Speech coding method and apparatus using mean squared error modifier for selected speech coder parameters using VSELP techniques
First Claim
Patent Images
1. A method of matching energy of speech coding vectors to an input speech vector comprising the steps of:
- choosing a codevector to represent the input speech vector;
optimizing a long term predictor coefficient and a gain term for the codevector, thereby forming an optimized long term predictor and an optimized gain term; and
determining a gain bias factor to more closely match an energy of the code vector to an energy of the input speech vector; and
altering the optimal long term predictor coefficient and the optimal gain term using the gain bias factor.
2 Assignments
0 Petitions
Accused Products
Abstract
An improved speech coder provides a more natural sounding replication of speech by modifying the mean-squared error criterion for the selected speech coder parameters. Specifically, the modification emphasizes the signal components that the speech coder has difficulty matching, i.e. the high frequencies. This emphasis is constrained to certain limitations to avoid over-emphasizing the speech.
27 Citations
13 Claims
-
1. A method of matching energy of speech coding vectors to an input speech vector comprising the steps of:
-
choosing a codevector to represent the input speech vector; optimizing a long term predictor coefficient and a gain term for the codevector, thereby forming an optimized long term predictor and an optimized gain term; and determining a gain bias factor to more closely match an energy of the code vector to an energy of the input speech vector; and altering the optimal long term predictor coefficient and the optimal gain term using the gain bias factor. - View Dependent Claims (2, 3, 4)
-
-
5. A method of speech coding comprising the steps of:
-
receiving a speech data signal; providing excitation vectors in response to said step of receiving; determining an excitation gain coefficient and a long term predictor coefficient for use by a long term predictor filter and a Pth-order short term predictor filter; filtering said excitation vectors utilizing said long term predictor filter and said short term predictor filter, forming filtered excitation vectors; comparing said filtered excitation vectors to said speech data signal, forming difference vectors; calculating energy of said filtered difference vectors, forming an error signal; choosing an excitation code, I, using the error signals, which best represents the received speech data; calculating optimal excitation gain and optimal long term predictor gain for the chosen excitation codebook vector; forming a synthetic excitation signal using said chosen excitation code, the optimal excitation gain and said optimal long term predictor gain; calculating an energy of the speech data signal, forming a speech data energy value; calculating an energy of the synthetic excitation signal, forming a synthetic excitation energy value; determining a gain bias factor to more closely match the speech data energy value and the synthetic excitation energy value; and quantizing the optimal excitation gain and the optimal long term predictor gain to minimize the error between the speech data signal and the synthetic excitation signal.
-
-
6. A speech coder for providing a codevector and associated gain terms in response to an input speech vector, the speech coder comprising:
-
a codebook search controller for choosing a codevector to represent the input speech vector; a mean square error (MSE) modifier comprising; an optimizer for optimizing a long term predictor coefficient and a gain term for the codevector, thereby forming an optimized long term predictor and an optimized gain term; a bias generator for determining a gain bias factor to more closely match an energy of the code vector to the input speech vector; and an alterer for altering the optimal long term predictor coefficient and the optimal gain term using the gain bias factor.
-
-
7. A method of matching energy of a reconstructed speech vector to an input speech vector comprising the steps of:
-
choosing at least one codevector to represent the input speech vector; determining a gain term for each of the at least one codevector; combining the chosen codevector, using the corresponding codevector gain term(s), to produce a combined excitation vector; filtering the combined excitation vector to produce a reconstructed speech vector, determining a gain bias factor to more closely match an energy of the reconstructed speech vector to an energy of the input speech vector; and altering the gain term using the gain bias factor.
-
-
8. A method of matching energy of a reconstructed speech vector to an input speech vector comprising the steps of:
-
choosing at least one codevector to represent the input speech vector; determining a long term predictor coefficient and a gain term for each of the at least one codevectors; combining a long term predictor vector and the chosen codevector(s), using the long term predictor coefficient and the codevector gain term(s) to produce a combined excitation vector; filtering the combined excitation vector to produce a reconstructed speech vector; determining a gain bias factor to more closely match an energy of the reconstructed speech vector to an energy of the input speech vector; and altering the long term predictor coefficient and the gain term using the gain bias factor. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A method of speech coding comprising the steps of:
-
receiving a speech data signal; providing excitation vectors in response to said step of receiving; determining an excitation gain coefficient and a long term predictor coefficient for use by a long term predictor filter and a Pth-order short term predictor filter; filtering said excitation vectors utilizing said long term predictor filter and said short term predictor filter, forming filtered excitation vectors; comparing said filtered excitation vectors to said speech data signal, forming difference vectors; calculating energy of said difference vectors, forming an error signal; choosing an excitation code, I, using the error signals, which best represents the received speech data; calculating optimal excitation gain and optimal long term predictor gain for the chosen excitation codebook vector; forming a synthetic excitation signal using said chosen excitation code, the optimal excitation gain and said optimal long term predictor gain; filtering a synthetic excitation signal to form a synthetic speech signal, calculating an energy of the speech data signal, forming a speech data energy value; calculating an energy of the synthetic speech signal, forming a synthetic speech energy value; determining a gain bias factor to more closely match the speech data energy value and the synthetic speech energy value; adjusting speech data signal based on a gain bias factor; and quantizing the excitation gain and the long term predictor gain to minimize the error between the adjusted speech data signal and the synthetic speech signal.
-
Specification