Methods and apparatus relating to searching of spoken audio data
First Claim
1. A method of searching spoken audio data for a search term comprising the steps of taking phonetic index data corresponding to the spoken audio data, searching the phonetic index data for likely matches to the search term, wherein when a likely match is detected a portion of the spoken audio data or phonetic index data is selected which includes the likely match and said selected portion of the spoken audio data or phonetic index data is processed using a large vocabulary speech recogniser.
4 Assignments
0 Petitions
Accused Products
Abstract
This invention relates to a method of searching spoken audio data for one or more search terms comprising performing a phonetic search of the audio data to identify likely matches to a search term and producing textual data corresponding to a portion of the spoken audio data including a likely match. An embodiment of the method comprises the steps of taking phonetic index data corresponding to the spoken audio data, searching the phonetic index data for likely matches to the search term, wherein when a likely match is detected a portion of the spoken audio data or phonetic index data is selected which includes the likely match and said selected portion of the spoken audio data or phonetic index data is processed using a large vocabulary speech recogniser. The large vocabulary speech recogniser may derive textual data which can be used for further processing or may be used to present a transcript to a user. The present invention therefore combines the benefit of phonetic searching of audio data with the advantages of large vocabulary speech recognition.
-
Citations
24 Claims
- 1. A method of searching spoken audio data for a search term comprising the steps of taking phonetic index data corresponding to the spoken audio data, searching the phonetic index data for likely matches to the search term, wherein when a likely match is detected a portion of the spoken audio data or phonetic index data is selected which includes the likely match and said selected portion of the spoken audio data or phonetic index data is processed using a large vocabulary speech recogniser.
- 20. A method of searching spoken audio data for one or more search terms comprising performing a phonetic search of the audio data to identify likely matches to a search term and producing textual data corresponding to a portion of the spoken audio data including a likely match.
- 22. A method of processing spoken audio data comprising the steps of using a phonetic search engine to identify possible matches to at least one search term and using a large vocabulary speech engine on a portion of the spoken audio data including a likely match.
-
24. A hybrid audio search engine comprising a phonetic search engine and a large vocabulary search engine wherein the large vocabulary search engine is adapted to operate on portions of spoken audio data identified by the phonetic search engine as likely matches to one or more search terms.
Specification