Methods and apparatus relating to searching of spoken audio data
First Claim
1. A method of searching spoken audio data for a search term, said method comprising the steps of:
- taking phonetic index data corresponding to the spoken audio data,searching the phonetic index data for likely matches to the search term,selecting a portion of the spoken audio data when a likely match is detected and,processing said selected portion of the spoken audio data for producing textual data, wherein said searching and selecting steps are implemented on a phonetic audio searcher and said processing step is implemented on a large vocabulary speech recogniser.
4 Assignments
0 Petitions
Accused Products
Abstract
This invention relates to a method of searching spoken audio data for one or more search terms comprising performing a phonetic search of the audio data to identify likely matches to a search term and producing textual data corresponding to a portion of the spoken audio data including a likely match. An embodiment of the method comprises the steps of taking phonetic index data corresponding to the spoken audio data, searching the phonetic index data for likely matches to the search term, wherein when a likely match is detected a portion of the spoken audio data or phonetic index data is selected which includes the likely match and said selected portion of the spoken audio data or phonetic index data is processed using a large vocabulary speech recognizer. The large vocabulary speech recognizer may derive textual data which can be used for further processing or may be used to present a transcript to a user. The present invention therefore combines the benefit of phonetic searching of audio data with the advantages of large vocabulary speech recognition.
31 Citations
22 Claims
-
1. A method of searching spoken audio data for a search term, said method comprising the steps of:
-
taking phonetic index data corresponding to the spoken audio data, searching the phonetic index data for likely matches to the search term, selecting a portion of the spoken audio data when a likely match is detected and, processing said selected portion of the spoken audio data for producing textual data, wherein said searching and selecting steps are implemented on a phonetic audio searcher and said processing step is implemented on a large vocabulary speech recogniser. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method of searching spoken audio data for one or more search terms comprising the steps of:
-
performing a phonetic search of the audio data to identify likely matches to a search term; and producing textual data corresponding to a portion of the spoken audio data including a likely match, wherein said performing step is performed on a phonetic audio searcher and said producing step is performed on a large vocabulary speech recogniser. - View Dependent Claims (21)
-
-
22. A hybrid audio search engine comprising:
-
a phonetic search engine for identifying portions of spoken audio data as likely matches to one or more search terms; and a large vocabulary search engine, wherein the large vocabulary search engine is configured to operate on said portions of spoken audio data.
-
Specification