Method and apparatus for hybrid coding of speech at 4KBPS having phase alignment between mode-switched frames
First Claim
1. A method for encoding speech in an encoder for communication to a decoder for reproduction thereof, said speech comprising a plurality of frames of speech, said method comprising the steps of:
- (a) classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character;
(b) representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) quantizing said parameters for communication to said decoder; and
(e) phase aligning reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonic coded.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.
159 Citations
10 Claims
-
1. A method for encoding speech in an encoder for communication to a decoder for reproduction thereof, said speech comprising a plurality of frames of speech, said method comprising the steps of:
-
(a) classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character;
(b) representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) quantizing said parameters for communication to said decoder; and
(e) phase aligning reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonic coded.
-
-
2. A method for encoding speech in an encoder for communication to a decoder for reproduction thereof, said speech comprising a plurality of frames of speech, said method comprising the steps of:
-
(a) classifying each frame of speech into three or more classes wherein one or more of said classes is transitory character;
(b) representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) computing parameter values of said harmonic model where said parameter values characteristic of the frame;
(d) quantizing said parameters for communication to said decoder; and
(e) phase aligning a harmonic coded frame of speech in the decoder when the preceding frame of speech has been waveform coded for pairs of adjacent frames of speech comprising a waveform coded frame of speech adjacent to a harmonic coded frame of speech.
-
-
3. A method for encoding speech in an encoder for communication to a decoder for reproduction thereof, said speech comprising a plurality of frames of speech, said method comprising the steps of:
-
(a) classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character;
(b) representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) quantizing said parameters for communication to said decoder; and
(e) phase aligning the frame in the encoder to be waveform coded when the subsequent frame is to be harmonic coded for pairs of adjacent frames comprising a waveform coded frame adjacent to a harmonic coded frame.
-
-
4. A hybrid method of encoding speech in an encoder for transmission to a decoder for reproduction thereof, comprising the steps of:
-
(a) classifying frames of the speech signal into steady state voiced, stationary unvoiced, or transitory speech segments;
(b) coding a frame with harmonic coding if the frame is classified as steady state voiced speech;
(c) coding a frame with “
noise-like”
coding if the frame is classified as stationary unvoiced speech;
(d) coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding; and
(e) phase aligning a harmonic coded frame of speech in the decoder when the preceding frame of speech had been waveform coded for pairs of adjacent frames of speech comprising a waveform coded frame of speech adjacent to a harmonic coded frame of speech.
-
-
5. A hybrid method of encoding speech in an encoder for transmission to a decoder for reproduction thereof, comprising the steps of:
-
(a) classifying frames of the speech signal into steady state voiced, stationary unvoiced, or transitory speech segments;
(b) coding a frame with harmonic coding if the frame is classified as steady state voiced speech;
(c) coding a frame with “
noise-like”
coding if the frame is classified as stationary unvoiced speech;
(d) coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding; and
(e) phase aligning the frame in the encoder to be waveform coded when the subsequent frame is to be harmonic coded for pairs of adjacent frames comprising a waveform coded frame adjacent to a harmonic coded frame.
-
-
6. A hybrid speech encoder for communication to a decoder for reproduction of speech, said speech comprising a plurality of frames of speech, said encoder comprising:
-
(a) means for classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character;
(b) means for representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) means for computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) means for quantizing said parameters for communication to said decoder; and
(e) means for phase aligning reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonic coded.
-
-
7. A hybrid speech encoder for communication to a decoder for reproduction of speech, said speech comprising a plurality of frames of speech, said encoder comprising:
-
(a) means for classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character;
(b) means for representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) means for computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) means for quantizing said parameters for communication to said decoder; and
(e) means for phase aligning a harmonic coded frame of speech in the decoder when the preceding frame of speech has been waveform coded for pairs of adjacent frames of speech comprising a waveform coded frame of speech adjacent to a harmonic coded frame of speech.
-
-
8. A hybrid speech encoder for communication to a decoder for reproduction of speech, said speech comprising a plurality of frames of speech, said encoder comprising:
-
(a) means for classifying each frame of speech into three or more classes wherein one or more of said classes in transitory in character;
(b) means for representing the speech in a frame of speech associated with at least one of said classes with a harmonic model;
(c) means for computing parameter values of said harmonic model where said parameter values are characteristic of the frame;
(d) means for quantizing said parameters for communication to said decoder; and
(e) means for phase aligning the frame in the encoder to be waveform coded when the subsequent frame is to be harmonic coded for pairs of adjacent frames comprising a waveform coded frame adjacent to a harmonic coded frame.
-
-
9. An apparatus for encoding speech for transmission to a decoder for reproduction thereof, comprising:
-
(a) means for classifying frames of the speech signal as steady state voiced, stationary unvoiced, or transitory speech;
(b) means for coding a frame with harmonic coding if the frame is classified as steady state voiced speech;
(c) means for coding a frame with “
noise-like”
coding if the frame is classified as stationary unvoiced speech;
(d) means for coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding; and
(e) means for phase aligning a harmonic coded frame of speech in the decoder when the preceding frame of speech has been waveform coded for pairs of adjacent frames of speech comprising a waveform coded frame of speech adjacent to a harmonic coded frame of speech.
-
-
10. An apparatus for encoding speech for transmission to a decoder for reproduction thereof, comprising:
-
(a) means for classifying frames of the speech signal as steady state voiced, stationary unvoiced, or transitory speech;
(b) means for coding a frame with harmonic coding if the frame is classified as steady state voiced speech;
(c) means for coding a frame with “
noise-like”
coding if the frame is classified as stationary unvoiced speech;
(d) means for coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding; and
(e) means for phase aligning the frame in the encoder to be waveform coded when the subsequent frame is to be harmonic coded for pairs of adjacent frames comprising a waveform coded frame adjacent to a harmonic coded frame.
-
Specification