Codebook tables for encoding and decoding
First Claim
1. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
- an encoding system operable to determine a rate selection and a type classification for the speech signal, the encoding system comprising;
a line spectrum frequency prediction error quantization table selectable as a function of the rate selection, the line spectrum frequency prediction error quantization table associated with encoding short-term predictor parameters of the speech signal;
a 2D gain quantization table associated with jointly encoding an adaptive codebook gain and a fixed codebook gain of the speech signal when the type classification is a first type;
a pre-gain quantization table selectable as a function of the rate selection, the pre-gain quantization table associated with exclusively encoding the adaptive codebook gain when the type classification is a second type;
a delayed gain quantization table selectable as a function of the rate selection, the delayed gain quantization table associated with exclusively encoding the fixed codebook gain when the type classification is the second type; and
a decoding system in communication with the encoding system, the decoding system operable to decode the speech signal with the line spectrum frequency prediction error quantization table and at least one of;
the 2D gain quantization table, the pre-gain quantization table, and the delayed gain quantization table, as a function of the rate selection and the type classification.
12 Assignments
0 Petitions
Accused Products
Abstract
A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.
165 Citations
47 Claims
-
1. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to determine a rate selection and a type classification for the speech signal, the encoding system comprising;
a line spectrum frequency prediction error quantization table selectable as a function of the rate selection, the line spectrum frequency prediction error quantization table associated with encoding short-term predictor parameters of the speech signal;
a 2D gain quantization table associated with jointly encoding an adaptive codebook gain and a fixed codebook gain of the speech signal when the type classification is a first type;
a pre-gain quantization table selectable as a function of the rate selection, the pre-gain quantization table associated with exclusively encoding the adaptive codebook gain when the type classification is a second type;
a delayed gain quantization table selectable as a function of the rate selection, the delayed gain quantization table associated with exclusively encoding the fixed codebook gain when the type classification is the second type; and
a decoding system in communication with the encoding system, the decoding system operable to decode the speech signal with the line spectrum frequency prediction error quantization table and at least one of;
the 2D gain quantization table, the pre-gain quantization table, and the delayed gain quantization table, as a function of the rate selection and the type classification. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
a first set of predictor coefficients; and
a second set of predictor coefficients;
wherein the first and the second set of predictor coefficients are selectable by the encoding system when the rate selection is a half rate.
-
-
22. The variable rate speech compression system of claim 21, where the first set of predictor coefficients comprises a first vector represented as {0.45782564, 0.59002827, 0.73704688, 0.73388197, 0.75903791, 0.74076479, 0.65966007, 0.58070788, 0.52280647, 0.42738207} and a second vector represented as {0.19087084, 0.26721569, 0.38110463, 0.39655069, 0.43984539, 0.42178869, 0.34869783, 0.28691864, 0.23847475, 0.17468375}.
-
23. The variable rate speech compression system of claim 21, where the second set of predictor coefficients comprises a first vector represented as {0.14936742, 0.25397094, 0.42536339, 0.40318214, 0.39778242, 0.34731435, 0.22773174, 0.17583478, 0.12497067, 0.11001108} and a second vector represented as {0.09932127, 0.15389237, 0.24021347, 0.24507006, 0.26478926, 0.23018456, 0.15178193, 0.11368182, 0.07674584, 0.06122567}.
-
24. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to determine a bit rate and a type classification for the speech signal, the bit rate comprising a first rate and a second rate, and the type classification comprising a first type and a second type, the encoding system comprising;
a line spectrum frequency prediction error quantization table selectable as a function of the bit rate, wherein the encoding system is operable to encode short-term predictor parameters of the speech signal with the line spectrum frequency prediction error quantization table;
an interpolation module operable with the line spectrum frequency prediction error quantization table to encode short-term predictor parameters, when the bit rate is the first rate and the type classification is the first type;
a line spectrum frequency predictor coefficient table selectable as a function of the bit rate, wherein the encoding system is operable to generate predicted line spectrum frequencies with the line spectrum frequency predictor coefficient table; and
a predictor switch module operable with the line spectrum frequency predictor coefficient table to generate predicted line spectrum frequencies, when the bit rate is the second rate. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method of processing a speech signal with a variable rate speech compression system, the method comprising:
-
determining a rate and a type for the speech signal;
encoding short-term predictor parameters of the speech signal with a line spectrum frequency prediction error quantization table as a function of the rate;
jointly encoding an adaptive codebook gain and a fixed codebook gain of the speech signal with a 2D gain quantization table when the type is a first type;
encoding the adaptive codebook gain with a pre-gain quantization table as a function of the rate when the type is a second type; and
encoding the fixed codebook gain with a delayed gain quantization table as a function of the rate when the type is the second type. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
selecting one of a first set of predictor coefficients and a second set of predictor coefficients from a line spectrum frequency predictor coefficients table; and
determining predicted line spectrum frequencies with the selected set of predictor coefficients.
-
-
36. The method of claim 33, where encoding with the line spectrum frequency prediction error quantization table when the rate is a full rate and the type is the first type comprises:
-
selecting one of a plurality of interpolation paths; and
adjusting the weighting of previously quantized line spectrum frequencies and the weighting of currently quantized line spectrum frequencies with the interpolation path.
-
-
37. The method of claim 33, where encoding with the pre-gain quantization table when the rate is a full rate comprises:
-
determining the adaptive codebook gain for each of four subframes of a frame of the speech signal; and
analyzing vectors in the pre-gain quantization table comprising a first vector represented as {0.60699869, 0.59090763, 0.64920781, 0.64610492} to select one of the vectors with elements representing the adaptive codebook gain of each of the subframes.
-
-
38. The method of claim 33, where encoding with the pre-gain quantization table when the rate is a half rate comprises:
-
determining the adaptive codebook gain for each of three subframes of a frame of the speech signal; and
analyzing vectors in the pre-gain quantization table comprising a first vector represented as {1.16184904, 1.16859789, 1.13656320} to select one of the vectors with elements representing the adaptive codebook gain of each of the subframes.
-
-
39. The method of claim 33, where encoding with the delayed gain quantization table when the rate is a half rate comprises:
-
completing the search in a fixed codebook for each of three subframes of a frame of the speech signal;
determining the fixed codebook gain for each of the subframes; and
analyzing vectors in the delayed gain quantization table comprising a first vector represented as {0.18423671, 0.06523999, 0.13390472} to select one of the vectors with elements representing the fixed codebook gain of each of the subframes.
-
-
40. The method of claim 33, here encoding with the delayed gain quantization table when the type is the second type comprising:
-
representing the fixed codebook gain for each of a plurality of subframes of a frame of the speech signal with a fixed codebook energy;
generating a predicted fixed codebook energy for each of the subframes with quantized fixed codebook energy errors from a plurality of subframes of a previous frame and a plurality of predictor coefficients;
forming a vector with the difference in the fixed codebook energy and the predicted fixed codebook energy; and
selecting a corresponding vector from the delayed gain quantization table.
-
-
41. The method of claim 40, where generating the predicted fixed codebook energy comprises multiplying the quantized fixed codebook energy errors by the predictor coefficients, the predictor coefficients comprising a first subframe predictor coefficient represented as {0.7, 0.6, 0.4, 0.2}, a second subframe predictor coefficient represented as {0.4, 0.2, 0.1, 0.05}, a third subframe predictor coefficient represented as {0.3, 0.2, 0.075, 0.025} and a fourth subframe predictor coefficient represented as {0.2, 0.075, 0.025, 0.0}.
-
42. The method of claim 40, where generating the predicted fixed codebook energy comprises multiplying the quantized fixed codebook energy errors by the predictor coefficients, the predictor coefficients comprising:
-
a first predictor coefficient represented as {0.6, 0.3, 0.1};
a second predictor coefficient represented as {0.4, 0.25, 0.1}; and
a third predictor coefficient represented as {0.3, 0.15, 0.075};
wherein the rate selection is a half rate.
-
-
43. The method of claim 33, where jointly encoding with the 2D gain quantization table when the rate is the full rate comprises analyzing vectors within the 2D gain quantization table, the vectors comprising a first vector represented as {1.13718400, 2.00167200}.
-
44. A method of processing a speech signal, the method comprising:
-
selecting a bit rate and a type classification;
converting short-term predictor parameters extracted from the speech signal to line spectrum frequencies;
determining predicted line spectrum frequencies with a line spectrum frequency predictor coefficients table when the bit rate selected is a first rate;
determining predicted line spectrum frequencies with a line spectrum frequency predictor coefficients table and a predictor switch module when the bit rate selected is a second rate;
subtracting predicted line spectrum frequencies from line spectrum frequencies to generate a line spectrum frequencies prediction error;
quantizing the line spectrum frequencies predication error to produce quantized line spectrum frequencies; and
modifying the quantized line spectrum frequencies with an interpolation module when the bit rate selected is the first rate and the type classification is a first type;
wherein when the bit rate is the second rate, determining predicted line spectrum frequencies comprises selecting one of;
a first set of predictor coefficients, the first set of predictor coefficients including a first vector represented as {0.45782564, 0.59002827, 0.73704688, 0.73388197, 0.75903791, 0.74076479, 0.65966007, 0.58070788, 0.52280647, 0.42738207}; and
a second set of predictor coefficients, the second set of predictor coefficients including a first vector represented as {0.14936742, 0.25397094, 0.42536339, 0.40318214, 0.39778242, 0.34731435, 0.22773174, 0.17583478, 0.12497067, 0.11001108}. - View Dependent Claims (45, 46, 47)
analyzing the degree of spectral variations between a plurality of subframes of a frame of the speech signal; and
selecting an interpolation path as a function of the spectral variations.
-
Specification