Encoding and decoding speech signals variably based on signal classification
First Claim
1. A variable rate speech compression system for processing a frame of a speech signal to form an encoded speech signal, the speech compression system comprising:
- means for generating a first portion of the encoded speech signal as a function of a type classification and a rate selection of the frame;
means for generating a second portion of the encoded speech signal as a function of the type classification and the rate selection;
means for receiving the encoded speech signal and reconstructing linear prediction coefficients for the frame as a function of the rate selection;
means for receiving the encoded speech signal and reconstructing short term excitation as a function of the rate selection and the type classification of the frame; and
means for assembling the short-term excitation and the linear prediction coefficients to generate synthesized speech;
where the means for receiving the encoded speech signal and reconstructing the excitation is operable to reconstruct the short term excitation on a subframe basis when the type classification of the frame is type zero.
10 Assignments
0 Petitions
Accused Products
Abstract
A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.
159 Citations
71 Claims
-
1. A variable rate speech compression system for processing a frame of a speech signal to form an encoded speech signal, the speech compression system comprising:
-
means for generating a first portion of the encoded speech signal as a function of a type classification and a rate selection of the frame;
means for generating a second portion of the encoded speech signal as a function of the type classification and the rate selection;
means for receiving the encoded speech signal and reconstructing linear prediction coefficients for the frame as a function of the rate selection;
means for receiving the encoded speech signal and reconstructing short term excitation as a function of the rate selection and the type classification of the frame; and
means for assembling the short-term excitation and the linear prediction coefficients to generate synthesized speech;
where the means for receiving the encoded speech signal and reconstructing the excitation is operable to reconstruct the short term excitation on a subframe basis when the type classification of the frame is type zero. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the post-processing module comprises a long-term filter module operable to perform a fine-tuning search for a pitch period of the synthesized speech. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for processing a speech signal to generate synthesized speech, the speech compression system comprising:
-
a first decoder operable to decode a first frame of the speech signal as a function of a rate selected during encoding of the first frame, the first decoder comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients of the speech signal; and
a plurality of excitation reconstruction modules operable to reconstruct short-term excitation of the speech signal as a function of a type classification selected during encoding of the first frame; and
a second decoder operable to decode a second frame of the speech signal as a function of the rate selected during encoding of the second frame, the second decoder comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients of the encoded speech signal; and
an excitation reconstruction module operable to reconstruct short term excitation of the speech signal absent the type classification;
where the second decoder is one of a quarter-rate decoder operable at a rate of 2 kilobits per second and an eighth-rate decoder operable at a rate of 0.8 kilobits per second. - View Dependent Claims (20, 21, 22)
-
-
23. A method of decoding a frame of a speech signal previously encoded with a variable rate encoding system, the method comprising:
-
a) reconstructing short-term excitation as a function of a bit rate and a type classification selected when the frame was encoded;
b) reconstructing linear prediction coefficients as a function of the bit rate;
c) generating synthesized speech as a function of the short-term excitation and the linear prediction coefficients; and
d) filtering and compensating the synthesized speech as a function of the bit rate;
where d) comprises performing a fine-tuning search for a pitch period of the synthesized speech as a function of the bit rate. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
-
31. A variable rate speech compression system for processing a frame of a speech signal to form an encoded speech signal, the speech compression system comprising:
-
means for generating a first portion of the encoded speech signal as a function of a type classification and a rate selection of the frame;
means for generating a second portion of the encoded speech signal as a function of the type classification and the rate selection;
means for receiving the encoded speech signal and reconstructing linear prediction coefficients for the frame as a function of the rate selection;
means for receiving the encoded speech signal and reconstructing short term excitation as a function of the rate selection and the type classification of the frame; and
means for assembling the short-term excitation and the linear prediction coefficients to generate synthesized speech;
where the means for receiving the encoded speech signal and reconstructing the excitation is operable to reconstruct the short term excitation on a subframe basis and on a frame basis when the type classification of the frame is type one. - View Dependent Claims (32, 33, 34, 35, 36, 37)
-
-
38. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the linear prediction coefficient reconstruction module further comprises an interpolation module when the selected bit rate is a full rate and the type classification is type zero. - View Dependent Claims (39, 40, 41, 42, 43, 60)
-
-
44. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the linear prediction coefficient reconstruction module further comprises a predictor switch module when the selected bit rate is a half rate. - View Dependent Claims (45, 46, 47, 48)
-
-
49. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the excitation reconstruction module is operable to reconstruct the short-term excitation on a subframe basis when the type classification is type zero. - View Dependent Claims (50, 51, 52)
-
-
53. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the excitation reconstruction module is operable to reconstruct the short-term excitation on a subframe basis and on a frame basis when the type classification is type one. - View Dependent Claims (54, 55)
-
-
56. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a decoding system operable to receive a selected bit rate and decode the speech signal to generate synthesized speech, the decoding system comprising;
a linear prediction coefficient reconstruction module operable to reconstruct linear prediction coefficients as a function of the selected bit rate;
an excitation reconstruction module operable to reconstruct short-term excitation as a function of the selected bit rate and a type classification of the speech signal;
a synthesis filter module operable to assemble the short-term excitation and the linear prediction coefficients to generate synthesized speech; and
a post-processing module operable to filter and compensate the synthesized speech as a function of the selected bit rate;
where the excitation reconstruction module comprises an adaptive codebook, a fixed codebook, a 2D/VQ gain codebook, a 3D/4D open loop VQ codebook and a 3D/4D VQ gain codebook. - View Dependent Claims (57)
-
-
58. A method of decoding a frame of a speech signal previously encoded with a variable rate encoding system, the method comprising:
-
a) reconstructing short-term excitation as a function of a bit rate and a type classification selected when the frame was encoded;
b) reconstructing linear prediction coefficients as a function of the bit rate;
c) generating synthesized speech as a function of the short-term excitation and the linear prediction coefficients; and
d) filtering and compensating the synthesized speech as a function of the bit rate;
where d) comprises performing a fine-tuning search as a function of pitch correlation and gain controlled harmonic filtering, where at least one of the fine-tuning search and the gain controlled harmonic filtering is dependent on the bit rate. - View Dependent Claims (59, 61, 62)
-
-
63. A method of decoding a frame of a speech signal previously encoded with a variable rate encoding system, the method comprising:
-
a) reconstructing short-term excitation as a function of a bit rate and a type classification selected when the frame was encoded;
b) reconstructing linear prediction coefficients as a function of the bit rate;
c) generating synthesized speech as a function of the short-term excitation and the linear prediction coefficients; and
d) filtering and compensating the synthesized speech as a function of the bit rate;
where a) comprises reconstructing the short-term excitation on a subframe basis when the type classification is type zero. - View Dependent Claims (64, 65, 66)
-
-
67. A method of decoding a frame of a speech signal previously encoded with a variable rate encoding system, the method comprising:
-
a) reconstructing short-term excitation as a function of a bit rate and a type classification selected when the frame was encoded;
b) reconstructing linear prediction coefficients as a function of the bit rate;
c) generating synthesized speech as a function of the short-term excitation and the linear prediction coefficients; and
d) filtering and compensating the synthesized speech as a function of the bit rate;
where a) comprises reconstructing short-term excitation on a subframe basis and on a frame basis when the type classification is type one. - View Dependent Claims (68, 69)
-
-
70. A method of decoding a frame of a speech signal previously encoded with a variable rate encoding system, the method comprising:
-
a) reconstructing short-term excitation as a function of a bit rate and a type classification selected when the frame was encoded;
b) reconstructing linear prediction coefficients as a function of the bit rate;
c) generating synthesized speech as a function of the short-term excitation and the linear prediction coefficients; and
d) filtering and compensating the synthesized speech as a function of the bit rate;
where b) comprises reconstructing the linear prediction coefficients as a function of the type classification when the rate is a full rate. - View Dependent Claims (71)
-
Specification