×

Fast out-of-vocabulary search in automatic speech recognition systems

  • US 10,290,301 B2
  • Filed: 01/09/2017
  • Issued: 05/14/2019
  • Est. Priority Date: 12/29/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, on a computer system, a text search query;

    searching, on the computer system, a plurality of speech recognition processed audio files for instances of words of the text search query, the speech recognition processed audio files being associated with metadata including representations of one or more words detected in the audio files and one or more sub-words detected in the audio files, the metadata being generated by a speech recognition engine in accordance with a vocabulary, the searching comprising;

    identifying one or more query words from the text search query, the one or more identified query words not being in the vocabulary, each of the one or more query words comprising one or more sub-words;

    identifying segments of the speech recognition processed audio files, each of the segments comprising audio data, the segments being more likely than other portions of the audio file to include at least one of the identified query words, by searching the metadata for instances of the sub-words of the one or more query words to identify one or more anchor segments of the audio files and expanding the one or more anchor segments, each of the anchor segments including a start time and an end time within the audio file, the segments of the audio files comprising the anchor segments;

    performing speech recognition on the audio data of the identified segments for instances of the one or more identified query words using a constrained grammar comprising the one or more identified query words not in the vocabulary; and

    returning one or more search results comprising one or more audio files corresponding to segments containing instances of at least one of the identified query words.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×