Normalization of speech by adaptive labelling
First Claim
1. A speech coding apparatus comprising:
- means for measuring the value of at least one feature of an utternace, said utternace occurring over a series of successive time intervals, said means measuring the feature value of the utterance during each time interval to produce a series of feature vector signals representing the feature values;
means for storing a plurality of prototype vector signals, each prototype vector signal having at least one parameter value and having a unique identification value;
means for generating a first modified feature vector signal having a modified feature value, said modified feature value being related, by a modification function, to the feature value of a first feature vector signal in the series of feature vector signals;
means for comparing the modified feature value of the first modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the first modified feature vector signal;
means for altering the modification function to improve the match between the modified feature vector signal and its associated prototype vector signal determined by the comparison;
means for generating a second modified feature vector signal having a modified feature value, said modified feature value of the second modified feature vector being related, by the altered modification function, to the feature value of a second feature vector signal in the series of feature vector signals, said second feature vector signal following the first feature vector signal;
means for comparing the modified feature value of the second modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the second modified feature vector signal; and
means for outputting the identification value of the prototype vector signal associated with the second modified feature vector as a coded representation of the second feature vector signal.
2 Assignments
0 Petitions
Accused Products
Abstract
In a speech processor system in which prototype vectors of speech are generated by an acoustic processor under reference noise and known ambient conditions and in which feature vectors of speech are generated during varying noise and other ambient and recording conditions, normalized vectors are generated to reflect the form the feature vectors would have if generated under the reference conditions. The normalized vectors are generated by: (a) applying an operator function Ai to a set of feature vectors x occurring at or before time interval i to yield a normalized vector yi =Ai (x); (b) determining a distance error vector Ei by which the normalized vector is projectively moved toward the closest prototype vector to the normalized vector yi ; (c) up-dating the operator function for next time interval to correspond to the most recently determined distance error vector; and (d) incrementing i to the next time interval and repeating steps (a) through (d) wherein the feature vector corresponding to the incremented i value has the most recent up-dated operator function applied thereto. With successive time intervals, successive normalized vectors are generated based on a successively up-dated operator function. For each normalized vector, the closest prototype thereto is associated therewith. The string of normalized vectors or the string of associated prototypes (or respective label identifiers thereof) or both provide output from the acoustic processor.
53 Citations
8 Claims
-
1. A speech coding apparatus comprising:
-
means for measuring the value of at least one feature of an utternace, said utternace occurring over a series of successive time intervals, said means measuring the feature value of the utterance during each time interval to produce a series of feature vector signals representing the feature values; means for storing a plurality of prototype vector signals, each prototype vector signal having at least one parameter value and having a unique identification value; means for generating a first modified feature vector signal having a modified feature value, said modified feature value being related, by a modification function, to the feature value of a first feature vector signal in the series of feature vector signals; means for comparing the modified feature value of the first modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the first modified feature vector signal; means for altering the modification function to improve the match between the modified feature vector signal and its associated prototype vector signal determined by the comparison; means for generating a second modified feature vector signal having a modified feature value, said modified feature value of the second modified feature vector being related, by the altered modification function, to the feature value of a second feature vector signal in the series of feature vector signals, said second feature vector signal following the first feature vector signal; means for comparing the modified feature value of the second modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the second modified feature vector signal; and means for outputting the identification value of the prototype vector signal associated with the second modified feature vector as a coded representation of the second feature vector signal. - View Dependent Claims (2, 3, 4)
-
-
5. A method of coding speech, said method comprising the steps of:
-
measuring the value of at least one feature of an utterance, said utternace occurring over a series of successive time intervals, the feature value of the utterance being measured during each time interval to produce a series of feature vector signals representing the feature values; storing a plurality of prototype vector signals, each prototype vector signal having at least one parameter value and having a unique identification value; generating a first modified feature vector signal having a modified feature value, said modified feature value being related, by a modification function, to the feature value of a first feature vector signal in the series of feature vector signals; comparing the modified feature value of the first modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the first modified feature vector signal; altering the modification function to improve the match between the modified feature vector signal and its associated prototype vector signal determined by the comparison; generating a second modified feature vector signal having a modified feature value, said modified feature value of the second modified feature vector being related, by the altered modification function, to the feature value of a second feature vector signal in the series of feature vector signals, said second feature vector signal following the first feature vector signal; comparing the modified feature value of the second modified feature vector signal to the parameter values of the prototype vector signals to determine the associated prototype vector signal which is best matched to the second modified feature vector signal; and outputting the identification value of the prototype vector signal associated with the second modified feature vector as a coded representation of the second feature vector signal. - View Dependent Claims (6, 7, 8)
-
Specification