Speech recognition
First Claim
Patent Images
1. A method of speech recognition comprising the steps of:
- a) comparing a first audio signal comprising a first unknown utterance with a first set of audio representations to generate a first measure of similarity for each audio representation of said set, each audio representation being associated with a corresponding first item of data, a first item of data being associated with an associated item of data, the associated item of data having an audio representation which is not one of said set;
b) comparing a second audio signal comprising a second unknown utterance with a second set of audio representations to generate a second measure of similarity for each audio representation of said second set, each audio representation of said second set being associated with a corresponding second item of data c) selecting from data defining associations between items of data, items of data which are defined as being associated with one another according to the first item of data for which the first generated measure indicates the greatest similarity;
an item of data associated with the first item of data; and
the second item of data for which the second measure indicates the greatest similarity.
1 Assignment
0 Petitions
Accused Products
Abstract
In this invention vocabulary size of a speech recognizer for a large task is reduced by providing a recognizer only for the most common vocabulary items. Uncommon items are catered for by providing aliases from the common items. This allows accuracy to remain high while also allowing uncommon items to be recognized when necessary.
-
Citations
10 Claims
-
1. A method of speech recognition comprising the steps of:
-
a) comparing a first audio signal comprising a first unknown utterance with a first set of audio representations to generate a first measure of similarity for each audio representation of said set, each audio representation being associated with a corresponding first item of data, a first item of data being associated with an associated item of data, the associated item of data having an audio representation which is not one of said set;
b) comparing a second audio signal comprising a second unknown utterance with a second set of audio representations to generate a second measure of similarity for each audio representation of said second set, each audio representation of said second set being associated with a corresponding second item of data c) selecting from data defining associations between items of data, items of data which are defined as being associated with one another according to the first item of data for which the first generated measure indicates the greatest similarity;
an item of data associated with the first item of data; and
the second item of data for which the second measure indicates the greatest similarity. - View Dependent Claims (2, 3, 9, 10)
-
-
5. A method according to claim 4 in which the comparing step f) uses a confusion matrix which characterises errors which occur in the comparing step a) for said audio representations.
-
6. A device for retrieving a data record from a database storing a plurality of data records each of which includes a data item of a first category and a data item of a second or subsequent category, wherein the data items in the first category are designated as being either common or uncommon in dependence upon the frequency with which they appear in the data records stored in the database, the device comprising:
-
audio representation storage means for storing an audio representation in respect of each of the common data items in the first category;
association storage means for storing associations between each common data item and a plurality of uncommon data items whose audio representations are similar to but different from the audio representation of the respective associated common data item;
comparison means for comparing a signal derived from an unknown utterance with each of the audio representations of common data items stored in the audio representation storage means, generating a measure of similarity at least in respect of one or more audio representations which are sufficiently similar to the compared signal to give rise to a measure of similarity above a predetermined threshold and designating as candidate first category data items both the common data items whose audio representations gave rise to a measure of similarity above the threshold and the uncommon data items associated with the designated common data items according to the association storage means;
selection means for selecting one or more data items of a second or subsequent category; and
retrieval means for retrieving one or more data records including a first category data item equal to one of the candidate first data items designated by the comparison means and a second or subsequent category data item selected by the selection means. - View Dependent Claims (7, 8)
-
Specification