Speech compression system and method
First Claim
Patent Images
1. A speech compression system for processing a speech signal, the speech compression system comprising:
- a mode selection module configured to select one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; and
a prediction module configured to predict a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, the prediction module is further configured to predict the fixed codebook characteristic as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame;
wherein a fixed codebook gain for each of the subframe is represented with respective predicted fixed codebook characteristic when the second framing structure is selected;
wherein the speech compression system derives a pitch gain for each of a plurality of subframes during pitch pre-processing, quantizes the pitch gain of each of the subframes, and performs a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes.
10 Assignments
0 Petitions
Accused Products
Abstract
The invention improves the encoding and decoding of speech by focusing the encoding on the perceptually important characteristics of speech. The system analyzes selected features of an input speech signal, and first performing a common frame based speech coding of an input speech signal. The system then performs a speech coding based on either a first speech coding mode or a second speech coding mode. The selection of a mode is based on characteristics of the input speech signal. The first speech coding mode uses a first framing structure and the second speech coding mode uses a second framing structure.
36 Citations
16 Claims
-
1. A speech compression system for processing a speech signal, the speech compression system comprising:
-
a mode selection module configured to select one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; and a prediction module configured to predict a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, the prediction module is further configured to predict the fixed codebook characteristic as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame; wherein a fixed codebook gain for each of the subframe is represented with respective predicted fixed codebook characteristic when the second framing structure is selected; wherein the speech compression system derives a pitch gain for each of a plurality of subframes during pitch pre-processing, quantizes the pitch gain of each of the subframes, and performs a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of processing a speech signal, the method comprising:
-
selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; predicting a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, wherein the predicting is performed as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame; representing a fixed codebook gain for each of the subframe with respective predicted fixed codebook characteristic when the second framing structure is selected; deriving a pitch gain for each of a plurality of subframes during pitch pre-processing; quantizing the pitch gain of each of the subframes; and performing a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method of processing a speech signal, the method comprising:
-
selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; predicting a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, wherein the predicting is performed as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame; and representing a fixed codebook gain for each of the subframe with respective predicted fixed codebook characteristic when the second framing structure is selected; wherein the prediction coefficients comprise a first subframe predictor coefficient represented as {0.6, 0.3, 0.1}, a second subframe predictor coefficient represented as {0.4, 0.25, 0.1}, and a third subframe predictor coefficient represented as {0.3, 0.015, 0.075}. - View Dependent Claims (14, 15, 16)
-
Specification