×

Method and system for efficient spoken term detection using confusion networks

  • US 9,196,243 B2
  • Filed: 03/31/2014
  • Issued: 11/24/2015
  • Est. Priority Date: 03/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for spoken term detection, comprising:

  • receiving phone level out-of-vocabulary (OOV) keyword queries;

    converting the phone level OOV keyword queries to words;

    generating a confusion network (CN) based keyword searching (KWS) index; and

    using the CN based KWS index for both in-vocabulary (IV) keyword queries and the OOV keyword queries;

    wherein converting the phone level OOV keyword queries to words comprises;

    converting the phone level OOV keyword queries to phonetic finite state acceptors, wherein phone sequences for IV terms are looked up in a recognition lexicon and phone sequences for OOV terms are generated with a grapheme-to-phoneme model;

    expanding the phone level OOV keyword queries through composition with a weighted finite state transducer (WFST) that models probabilities of confusions between different phones;

    extracting N-best hypotheses represented by each expanded WFST; and

    mapping back the N-best hypotheses to a set of N or fewer word sequences through composition with a finite state transducer that maps from phone sequences to word sequences; and

    wherein the receiving, converting, generating and using steps are performed by a computer system comprising a memory and at least one processor coupled to the memory.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×