Database annotation and retrieval including phoneme data
First Claim
1. An apparatus for generating annotation data for annotating a data file comprising audio data, the apparatus comprising:
- an automatic speech recognizer operable to generate phoneme data for audio data in the data file;
a word decoder operable to process the phoneme data generated by the automatic speech recognizer to identify words within the phoneme data;
an annotation generator operable to generate annotation data for the data file by combining the generated phoneme data and the words identified by said word decoder, the annotation data being for use in subsequent retrieval of the data file;
an associator operable to associate the generated annotation data with the data file; and
a storage device operable to store the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data.
1 Assignment
0 Petitions
Accused Products
Abstract
The data structure is used in accessing a plurality of data files. The data stucture comprises a plurality of annotation storage areas adapted to correspond with the data files, each annotation storage area containing an annotation an annotation representing a time sequential signal and each annotation storage area comprising a plurality of block storage areas each containing phoneme and word data forming a respective temporal block of the annotation and each block having an associated time index identifying a timing of the block within the corresponding annotation. Each block storage area includes a plurality of node storage areas, each asociated with a node which represents a point in time at which a word and/or phoneme begins or ends within the corresponding annotation, and each node storage area having a time offset storage area containing a time offset defining the point in time represented by the node relative to the time index associated with the corresponding block. Each node storage area further has one or more phoneme link storage areas, each having a phoneme storage area containing data identifying a phoneme associated with corresponding node; or one or more word link storage areas, each having a word storage area containing data identifying a word associated with the corresponding node. One or more of the node storage areas has at least one of the phoneme link storage areas and at least one of the word link storage areas.
125 Citations
47 Claims
-
1. An apparatus for generating annotation data for annotating a data file comprising audio data, the apparatus comprising:
-
an automatic speech recognizer operable to generate phoneme data for audio data in the data file;
a word decoder operable to process the phoneme data generated by the automatic speech recognizer to identify words within the phoneme data;
an annotation generator operable to generate annotation data for the data file by combining the generated phoneme data and the words identified by said word decoder, the annotation data being for use in subsequent retrieval of the data file;
an associator operable to associate the generated annotation data with the data file; and
a storage device operable to store the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. An apparatus for generating annotation data for annotating a data file, the apparatus comprising;
-
an input operable to receive an input voice signal;
a speech recognizer operable (i) to convert the input voice signal into phoneme data and (ii) to process the phoneme data to identify words within the phoneme data;
an annotation generator operable to generate annotation data for the data file by combining the phoneme data and the words obtained by said speech recognizer, the annotation data being for use in subsequent retrieval of the data file;
an associator operable to associate the generated annotation data with the data file; and
a storage device operable to store the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (19, 20, 21, 22, 23)
-
-
24. An apparatus for generating annotation data for annotating a data file, the apparatus comprising:
-
an input operable to receive image data representative of text;
a character recognizer operable to convert the image data into text data;
a converter operable to convert words in the text data into phoneme data; and
an annotation generator operable to generate annotation data for the data file by combining the phoneme data and words in the text data, the annotation data being for use in subsequent retrieval of the data file;
an associator operable to associate the generated annotation data with the data file; and
a storage device operable to store the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (25, 26)
-
-
27. A method of generating annotation data for annotating a data file comprising audio data, the method comprising the steps of:
-
using an automatic speech recognizer to generate phoneme data for audio data in the data file;
using a word decoder to process the phoneme data generated by the automatic speech recognizer to identify words within the phoneme data;
generating annotation data for the data file by combining the generated phoneme data and the words identified by the word decoder, the annotation data being for use in subsequent retrieval of the data file;
associating the generated annotation data with the data file; and
storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A method of generating annotation data for annotating a data file, the method comprising the steps of:
-
receiving an input voice signal;
processing the input voice signal using a speech recognizer to generate phoneme data for the input voice signal;
processing the generated phoneme data to identify words within the phoneme data;
generating annotation data by combining the phoneme data and the word data generated for the input voice signal, the annotation data being for use in subsequent retrieval of the data file;
associating the generated annotation data with the data file; and
storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (38, 39)
-
-
40. A method of generating annotation data for use in annotating a data file, the method comprising the steps of:
-
receiving image data representative of text;
converting the image data into text data using a character recognizer;
converting words in the text data into phoneme data;
generating annotation data for the data file by combining the phoneme data and words within the text data, the annotation data being for use in subsequent retrieval of the data file;
associating the generated annotation data with the data file; and
storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data. - View Dependent Claims (41, 42)
-
-
43. A computer readable medium storing computer program code for causing a programmable processing apparatus to become operable to generate annotation data for annotating a data file comprising audio data, the computer readable medium comprising:
-
code for causing an automatic speech recognizer to generate phoneme data for audio data in the data file;
code for causing a word decoder to process the phoneme data generated by the automatic speech recognizer to identify words within the phoneme data;
code for generating annotation data for the data file by combining the generated phoneme data and the words identified by the word decoder, the annotation data being for use in subsequent retrieval of the data file;
code for associating the generated annotation data with the data file; and
code for storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data.
-
-
44. A computer readable medium storing computer executable instructions for causing a programmable processing apparatus to become operable to generate annotation data for annotating a data file, the medium comprising:
-
code for receiving an input voice signal;
code for causing an automatic speech recognizer to convert the input voice signal into phoneme data;
code for processing the phoneme data to identify words within the phoneme data;
code for generating annotation data by combining the generated phoneme data and the identified words, the annotation data being for use in subsequent retrieval of the data file;
code for associating the generated annotation data with the data file; and
code for storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data.
-
-
45. A computer readable medium storing computer executable instructions for causing a programmable processing apparatus to become operable to generate annotation data for annotating a data file, the medium comprising:
-
code for receiving image data representative of text;
code for converting the image data into text data using a character recognizer;
code for converting words in the text data into phoneme data;
code for generating annotation data for the data file by combining the phoneme data and words within the text data, the annotation data being for use in subsequent retrieval of the data file;
code for associating the generated annotation data with the data file; and
code for storing the annotation data in a database to allow the associated data file to be retrieved by searching for words and phonemes within the stored annotation data.
-
-
46. An apparatus for generating annotation data for use in annotating a data file comprising audio data, the apparatus comprising:
-
an automatic speech recognizer operable to generate phoneme data for audio data in the data file;
a word decoder operable to identify words within the phoneme data generated by said automatic speech recognizer; and
an annotation generator operable to generate annotation data by combining the generated phoneme data and the words identified by said word decoder and operable to generate header data relating to the speech recognizer that generated the phoneme data for the annotation, which header data includes data identifying the language and phoneme set used by said automatic speech recognizer.
-
-
47. An apparatus for generating annotation data for use in annotating a data file, the apparatus comprising:
-
an input operable to receive an input voice signal;
a speech recognizer operable to convert the input voice signal into phoneme data and words; and
an annotation generator operable to generate annotation data by combining the phoneme data and the words and operable to generate header data relating to the speech recognizer that generated the phoneme data for the annotation, which header data includes data identifying the language and the phoneme set used by the speech recognizer.
-
Specification