Celp-Based speech coding for fine grain scalability by altering sub-frame pitch-pulse
First Claim
1. A method of encoding a speech signal in a code excited linear prediction (CELP)-based speech processing system that includes an adaptive codebook and a fixed codebook, wherein the speech signal is divided into frames and each frame is further divided into sequential sub-frames, the method comprising:
- generating linear prediction coding (LPC) coefficients for a frame;
generating pitch-related information by using the adaptive codebook, for the sequential sub-frames of the frame;
generating fixed-code pulse information by using the fixed codebook, for a plurality of selected sub-frames of the frame;
generating a first bit-stream corresponding to the frame for the LPC coefficients, the pitch-related information, and the fixed-code pulse information for the plurality of selected sub-frames;
generating fixed-code pulse information by using the fixed codebook, for unselected sub-frames; and
separately generating a second bit-stream corresponding to speech enhancement of the frame from the fixed-code pulse information for the unselected sub-frames.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for providing a CELP-based speech coding with fine grain scalability include a parameter encoder that generates a basic bit-stream from LPC coefficients for a frame, pitch-related information for all the sub-frames obtained by searching an adaptive codebook, and first pulse-related information for even sub-frames obtained by searching a fixed codebook. The parameter encoder also generates enhancement bits, which are preceded by the basic bit-stream, from second pulse-related information for odd sub-frames. The quality of synthesized speech is improved on a basis of one additional odd sub-frame pulse, as more of the second pulse-related information in the enhancement bits is received by a decoder.
-
Citations
18 Claims
-
1. A method of encoding a speech signal in a code excited linear prediction (CELP)-based speech processing system that includes an adaptive codebook and a fixed codebook, wherein the speech signal is divided into frames and each frame is further divided into sequential sub-frames, the method comprising:
-
generating linear prediction coding (LPC) coefficients for a frame; generating pitch-related information by using the adaptive codebook, for the sequential sub-frames of the frame; generating fixed-code pulse information by using the fixed codebook, for a plurality of selected sub-frames of the frame; generating a first bit-stream corresponding to the frame for the LPC coefficients, the pitch-related information, and the fixed-code pulse information for the plurality of selected sub-frames; generating fixed-code pulse information by using the fixed codebook, for unselected sub-frames; and separately generating a second bit-stream corresponding to speech enhancement of the frame from the fixed-code pulse information for the unselected sub-frames. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of synthesizing speech in a code excited linear prediction (CELP)-based speech processing system that includes an adaptive codebook and a fixed codebook, wherein a speech signal is divided into frames and each frame is further divided into sub-frames, the method comprising:
-
receiving a basic bit-stream which includes linear prediction coding (LPC) coefficients for a frame, pitch-related information for all sub-frames of the frame, and first pulse-related information for a plurality of selected sub-frames of the frame; receiving enhancement bits which include second pulse-related information for unselected sub-frames of the frame; generating an excitation by referring to the adaptive codebook based on the pitch-related information included in the basic bit-stream; and by referring to the fixed codebook based on the first pulse-related information included in the basic bit-stream; generating an excitation by referring to the adaptive codebook based on the pitch-related information included in the basic bit-stream and by referring to the fixed codebook based on the part or the whole of the second pulse-related information included in the enhancement bits; and outputting synthesized speech according to the excitations and the LPC coefficients. - View Dependent Claims (9, 10, 11)
-
-
12. A speech processing system based on code excited linear prediction (CELP) for encoding a speech signal, wherein the speech signal is divided into frames and each frame is further divided into sub-frames, the system comprising:
-
a generator of linear prediction coding (LPC) coefficients for a frame; a first portion including an adaptive codebook for generating pitch-related information for each sub-frame of the frame; a second portion including a fixed codebook for generating fixed-code pulse information for each sub-frame of the frame, the pulse-related information including first fixed-code pulse information for a first kind of sub-frame and second fixed-code pulse information for a second kind of sub-frame; and a parameter encoder for generating a basic bit-stream from the LPC coefficients, the pitch-related information, and the first fixed-code pulse information, and for generating enhancement bits from the second pulse-related information. - View Dependent Claims (13, 14, 15)
-
-
16. A speech processing system based on code excited linear prediction (CELP) for synthesizing speech, wherein a speech signal is divided into frames and each frame is further divided into sub-frames, the system comprising:
-
a parameter decoder for extracting linear prediction coding (LPC) coefficients for a frame, pitch-related information for all the sub-frames of the frame, and first pulse-related information for a plurality of selected sub-frames of the frame, from a basic bit-stream received, and for extracting a second pulse-related information for unselected sub-frames of the frame from enhancement bits received; a first portion including an adaptive codebook for generating an excitation based on the pitch-related information; a second portion including a fixed codebook for generating an excitation based on the first pulse-related information or based on the second pulse-related information; and a synthesizer for outputting synthesized speech according to the excitations and the LPC coefficients. - View Dependent Claims (17, 18)
-
Specification