Methods and apparatus relating to searching of spoken audio data
First Claim
1. A method of processing audio data to provide a searchable data file comprising the step of analysing the audio data with a phonetic recogniser wherein the phonetic recogniser acts on frames of the audio data and determines, for each frame, a score for each of a set of reference phones, the score being indicative of the likelihood that said frame corresponds to said phone characterised in that the score for each of the reference set of phones for each of said audio frames is stored in the searchable data file.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods for processing audio data containing speech to produce a searchable index file and for subsequently searching such an index file are provided. The processing method uses a phonetic approach and models each frame of the audio data with a set of reference phones. A score for each of the reference phones, representing the difference of the audio from the phone model, is stored in the searchable data file for each of the phones in the reference set. A consequence of storing information regarding each of the reference phones is that the accuracy of searches carried out on the index file is not compromised by the rejection of information about particular phones. A subsequent search method is also provided which uses a simple and efficient dynamic programming search to locate instances of a search term in the audio. The methods of the present invention have particular application to the field of audio data mining.
97 Citations
21 Claims
- 1. A method of processing audio data to provide a searchable data file comprising the step of analysing the audio data with a phonetic recogniser wherein the phonetic recogniser acts on frames of the audio data and determines, for each frame, a score for each of a set of reference phones, the score being indicative of the likelihood that said frame corresponds to said phone characterised in that the score for each of the reference set of phones for each of said audio frames is stored in the searchable data file.
-
2. A method of processing audio data to provide a searchable data file comprising the step of analysing the audio data with a phonetic recogniser wherein the phonetic recogniser acts on frames of the audio data and determines, for each frame, a score for a plurality of reference phones, the score being indicative of the likelihood that said frame corresponds to said phone characterised in that the searchable data file stores scores for each said audio frame in a simple matrix format.
-
11. A method of searching audio data for a phonetic search sequence comprising the steps of;
-
(i) taking a searchable data file having a score for each of a set of reference phones for each of a series of frames of the audio data, the scores being indicative of the likelihood that that particular frame corresponds to that particular phone, (ii) searching said searchable data file to find likely matches to a phonetic search sequence wherein the scores for the reference phones for each audio frame are used to determine the likely matches. - View Dependent Claims (12, 13, 14, 15, 16)
-
- 17. A searchable data file comprising a phonetic index file, the phonetic index file corresponding to a series of frames of an audio data file and comprising for each frame a score for each of a set of reference phones.
-
21. An apparatus for acting on audio data to create a searchable data file comprising a reference set of phones, a phonetic recogniser adapted to compare a frame of audio data with the reference set of phones and to output a score indicative of the likelihood that said frame corresponds to each phone, and a data output for creating a searchable data file comprising, for each audio frame, the score for each of the set of reference phones.
Specification