Speaker-independent label coding apparatus
First Claim
1. A speech coding apparatus comprising:
- means for storing a plurality of classes each having an identifier represented by at least two of a plurality of prototypes, each of the plurality of prototypes having at least one prototype value;
transducer means for extracting from an utterance a feature vector signal having at least one feature value;
means for establishing a match between the feature vector signal and at least one of the classes by selecting from the plurality of prototypes at least one prototype having a prototype value that best matches the feature value of the feature vector signal; and
means for coding the feature vector signal with the identifier of the class represented by the selected at least one prototype vector.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is related to speech recognition and particularly to a new type of vector quantizer and a new vector quantization technique in which the error rate of associating a sound with an incoming speech signal is drastically reduced. To achieve this end, the present invention technique groups the feature vectors in a space into different prototypes at least two of which represent a class of sound. Each of the prototypes may in turn have a number of subclasses or partitions. Each of the prototypes and their subclasses may be assigned respective identifying values. To identify an incoming speech feature vector, at least one of the feature values of the incoming feature vector is compared with the different values of the respective prototypes, or the subclasses of the prototypes. The class of sound whose group of prototypes, or at least one of the prototypes, whose combined value most closely matches the value of the feature value of the feature vector is deemed to be the class corresponding to the feature vector. The feature vector is then labeled with the identifier associated with that class.
51 Citations
76 Claims
-
1. A speech coding apparatus comprising:
-
means for storing a plurality of classes each having an identifier represented by at least two of a plurality of prototypes, each of the plurality of prototypes having at least one prototype value; transducer means for extracting from an utterance a feature vector signal having at least one feature value; means for establishing a match between the feature vector signal and at least one of the classes by selecting from the plurality of prototypes at least one prototype having a prototype value that best matches the feature value of the feature vector signal; and means for coding the feature vector signal with the identifier of the class represented by the selected at least one prototype vector. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A speech coding apparatus comprising:
-
means for storing a plurality of prototypes representative of a plurality of classes, each class having an identifier represented by at least two of the plurality of prototypes, each of the plurality of prototypes having at least one prototype value; transducer means for extracting from an utterance a feature vector signal having at least one feature value; means for establishing a match between the feature vector signal and at least one class by comparing the feature value of the feature vector signal against the respective prototype values of the prototypes; means for coding the feature vector signal with the identifier of the class represented by any of the prototypes having a prototype value most closely matching the feature value of the feature vector signal. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method of coding speech comprising the steps of:
-
(a) storing in a memory means a plurality of classes each having an identifier represented by at least two of a plurality of prototypes, each of the plurality of prototypes having at least one prototype value; (b) using transducer means to extract from an utterance a feature vector signal having at least one feature value; (c) establishing a correspondence between the feature vector signal and at least one class of the plurality of classes by selecting from among a plurality of prototypes at least one prototype whose prototype value most closely matches the feature value of the feature vector signal; and (d) coding the feature vector signal with the identifier of class represented by the selected at least one prototype. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method of coding speech comprising the steps of:
-
(a) storing in a memory means a plurality of prototype vectors representative of a plurality of classes, each class having an identifier represented by at least one of the plurality of prototype vectors, each of the plurality of prototype vectors having at least one prototype value; (b) using transducer means to extract from an utterance a feature vector signal having a feature value; (c) establishing a correspondence between the feature vector signal and at least one class by comparing the feature value of the feature vector signal against the respective prototype values of the prototype vectors; (d) coding the feature vector signal with the identifier of the class represented by any of the prototype vectors having a prototype value that most closely matches the feature value of the feature vector signal. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
-
-
44. A speech coding apparatus comprising:
-
means for storing two or more prototype vector signals, each prototype vector signal representing a prototype vector having an identifier and at least two partitions, each partition having at least one partition value; transducer means for measuring value of at least one feature of an utterance during a time interval to produce a feature vector signal representing the value of the at least one feature of the utterance; means for calculating a match score for each partition, each partition match score representing the value of a match between the partition value of the partition and the feature value of the feature vector signal; means for calculating a prototype match score for each prototype vector, each prototype match score representing a function of the partition match scores for all partitions in the prototype vector; and means for coding the feature vector signal with the identifier of the prototype vector signal having a best prototype match score. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
-
55. A speech coding method comprising the steps of:
-
storing two or more prototype vector signals, each prototype vector signal representing a prototype vector having an identifier and at least two partitions, each partition having at least one partition value; using transducer means to measure a value of at least one feature of an utterance during a time interval to produce a feature vector signal representing the value of the at least one feature of the utterance; calculating a match score for each partition, each partition match score representing the value of a match between the partition value of the partition and the feature value of the feature vector signal; calculating a prototype match score for each prototype vector, each prototype match score representing a function of the partition match scores for all partitions in the prototype vector; and coding the feature vector signal with the identifier of the prototype vector signal having the a prototype match score. - View Dependent Claims (56, 57, 58, 59, 60, 61, 62, 63, 64, 65)
-
-
66. An article for configuring a machine to perform a method of speech coding comprising the steps of:
-
storing two or more prototype vector signals, each prototype vector signal representing a prototype vector having an identifier and at least two partitions, each partition having at least one partition value; using transducer means to measure a value of at least one feature of an utterance during a time interval to produce a feature vector signal representing the value of the at least one feature of the utterance; calculating a match score for each partition, each partition match score representing the value of a match between the partition value of the partition and the feature value of the feature vector signal; calculating a prototype match score for each prototype vector, each prototype match score representing a function of the partition match scores for all partitions in the prototype vector; and coding the feature vector signal with the identifier of the prototype vector signal having a best prototype match score. - View Dependent Claims (67, 68, 69, 70, 71, 72, 73, 74, 75, 76)
-
Specification