FAST OUT-OF-VOCABULARY SEARCH IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
4 Assignments
0 Petitions
Accused Products
Abstract
A method including: receiving, on a computer system, a text search query, the query including one or more query words; generating, on the computer system, for each query word in the query, one or more anchor segments within a plurality of speech recognition processed audio files, the one or more anchor segments identifying possible locations containing the query word; post-processing, on the computer system, the one or more anchor segments, the post-processing including: expanding the one or more anchor segments; sorting the one or more anchor segments; and merging overlapping ones of the one or more anchor segments; and searching, on the computer system, the post-processed one or more anchor segments for instances of at least one of the one or more query words using a constrained grammar.
14 Citations
39 Claims
-
1-19. -19. (canceled)
-
20. A method comprising:
-
receiving, on a computer system, a text search query; searching, on the computer system, a plurality of speech recognition processed audio files for instances of words of the text search query, the speech recognition processed audio files being associated with metadata including representations of one or more words detected in the audio files, the metadata being generated by a speech recognition engine in accordance with a vocabulary, the searching comprising; identifying one or more query words from the text search query, the one or more identified query words not being in the vocabulary; identifying segments of the speech recognition processed audio files, the segments being more likely than other portions of the audio file to include at least one of the identified query words; performing speech recognition on the identified segments for instances of the one or more identified query words using a constrained grammar; and returning one or more search results comprising one or more audio files corresponding to segments containing instances of at least one of the identified query words. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A system comprising:
-
a processor; and memory, the memory having instructions that, when executed by the processor, cause the processor to; receive a text search query; search a plurality of speech recognition processed audio files for instances of words of the text search query, the speech recognition processed audio files being associated with metadata including representations of one or more words detected in the audio files, the metadata being generated by a speech recognition engine in accordance with a vocabulary, the searching comprising; identifying one or more query words from the text search query, the one or more identified query words not being in the vocabulary; identifying segments of the speech recognition processed audio files, the segments being more likely than other portions of the audio file to include at least one of the identified query words; performing speech recognition on the identified segments for instances of the identified query words using a constrained grammar; and returning one or more search results comprising one or more audio files corresponding to segments containing instances of at least one of the identified query words. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39)
-
Specification