Speech compression system and method

US 7,191,122 B1
Filed: 04/22/2005
Issued: 03/13/2007
Est. Priority Date: 09/22/1999
Status: Expired due to Term

First Claim

Patent Images

1. A speech compression system for processing a speech signal, the speech compression system comprising:

a mode selection module configured to select one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; and

a prediction module configured to predict a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, the prediction module is further configured to predict the fixed codebook characteristic as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame;

wherein a fixed codebook gain for each of the subframe is represented with respective predicted fixed codebook characteristic when the second framing structure is selected;

wherein the speech compression system derives a pitch gain for each of a plurality of subframes during pitch pre-processing, quantizes the pitch gain of each of the subframes, and performs a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes.

View all claims

10 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention improves the encoding and decoding of speech by focusing the encoding on the perceptually important characteristics of speech. The system analyzes selected features of an input speech signal, and first performing a common frame based speech coding of an input speech signal. The system then performs a speech coding based on either a first speech coding mode or a second speech coding mode. The selection of a mode is based on characteristics of the input speech signal. The first speech coding mode uses a first framing structure and the second speech coding mode uses a second framing structure.

36 Citations

View as Search Results

16 Claims

1. A speech compression system for processing a speech signal, the speech compression system comprising:
- a mode selection module configured to select one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal; and
  
  a prediction module configured to predict a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, the prediction module is further configured to predict the fixed codebook characteristic as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame;
  
  wherein a fixed codebook gain for each of the subframe is represented with respective predicted fixed codebook characteristic when the second framing structure is selected;
  
  wherein the speech compression system derives a pitch gain for each of a plurality of subframes during pitch pre-processing, quantizes the pitch gain of each of the subframes, and performs a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The speech compression system of claim 1, wherein the first framing structure has a first bit allocation and the second framing structure has a second bit allocation, wherein the first bit allocation is different than the second bit allocation, and wherein each of the first bit allocation and the second bit allocation includes a bit allocation indicative of the selected framing structure.
  - 3. The speech compression system of claim 1, wherein the prediction module applies a third order moving average prediction.
  - 4. The speech compression system of claim 1, wherein the prediction coefficients comprise a first subframe predictor coefficient represented as {0.6, 0.3, 0.1}, a second subframe predictor coefficient represented as {0.4, 0.25, 0.1}, and a third subframe predictor coefficient represented as {0.3, 0.015, 0.075}.
  - 5. The speech compression system of claim 1, wherein the fixed codebook characteristic comprises fixed codebook energy.
  - 6. The speech compression system of claim 1, wherein a fixed codebook characteristic for each of a plurality of subframes is not predicted when the first framing structure is selected, and wherein a fixed codebook gain for each of the subframe is not represented with respective predicted fixed codebook characteristic when the first framing structure is selected.

7. A method of processing a speech signal, the method comprising:
- selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal;
  
  predicting a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, wherein the predicting is performed as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame;
  
  representing a fixed codebook gain for each of the subframe with respective predicted fixed codebook characteristic when the second framing structure is selected;
  
  deriving a pitch gain for each of a plurality of subframes during pitch pre-processing;
  
  quantizing the pitch gain of each of the subframes; and
  
  performing a delayed joint quantization of the fixed codebook gains for each of the subframes as a function of the stored quantized pitch gain of each of the subframes.
- View Dependent Claims (8, 9, 10, 11, 12)
- - 8. The method of claim 7, wherein the predicting and the representing are not performed when the first framing structure is selected.
  - 9. The method of claim 7, wherein the predicting the fixed codebook characteristic comprises applying a third order moving average prediction.
  - 10. The method of claim 7, wherein the prediction coefficients comprise a first subframe predictor coefficient represented as {0.6, 0.3, 0.1}, a second subframe predictor coefficient represented as {0.4, 0.25, 0.1}, and a third subframe predictor coefficient represented as {0.3, 0.015, 0.075}.
  - 11. The method of claim 7, wherein the fixed codebook characteristic comprises fixed codebook energy.
  - 12. The method of claim 7, wherein the first framing structure has a first bit allocation and the second framing structure has a second bit allocation, wherein the first bit allocation is different than the second bit allocation, and wherein each of the first bit allocation and the second bit allocation includes a bit allocation indicative of the selected framing structure.

13. A method of processing a speech signal, the method comprising:
- selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the speech signal;
  
  predicting a fixed codebook characteristic for each of a plurality of subframes when the second framing structure is selected, wherein the predicting is performed as a function of prediction coefficients associated with each subframe and a fixed codebook characteristic from each of plurality of subframes of a previous frame; and
  
  representing a fixed codebook gain for each of the subframe with respective predicted fixed codebook characteristic when the second framing structure is selected;
  
  wherein the prediction coefficients comprise a first subframe predictor coefficient represented as {0.6, 0.3, 0.1}, a second subframe predictor coefficient represented as {0.4, 0.25, 0.1}, and a third subframe predictor coefficient represented as {0.3, 0.015, 0.075}.
- View Dependent Claims (14, 15, 16)
- - 14. The method of claim 13, wherein the predicting the fixed codebook characteristic comprises applying a third order moving average prediction.
  - 15. The method of claim 13, wherein the fixed codebook characteristic comprises fixed codebook energy.
  - 16. The method of claim 13, wherein the first framing structure has a first bit allocation and the second framing structure has a second bit allocation, wherein the first bit allocation is different than the second bit allocation, and wherein each of the first bit allocation and the second bit allocation includes a bit allocation indicative of the selected framing structure.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
DigiMedia Tech, LLC (IP Investments Group LLC)
Original Assignee
Mindspeed Technologies Inc. (MACOM Technology Solutions Holdings, Inc.)
Inventors
Thyssen, Jes, Shlomot, Eyal, Su, Huan-Yu, Gao, Yang, Benyassine, Adil
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
Opsasnick, Michael N

Application Number

US11/112,394
Time in Patent Office

690 Days
Field of Search

704/223, 704/233, 704/230
US Class Current

704/223
CPC Class Codes

G10L 19/00   Speech or audio signals ana...

G10L 19/167   Audio streaming, i.e. forma...

G10L 19/20   using sound class specific ...

G10L 19/22   Mode decision, i.e. based o...

G10L 19/24   Variable rate codecs, e.g. ...

G10L 2019/0001   Codebooks

H03G 3/00   Gain control in amplifiers ...

Speech compression system and method

First Claim

10 Assignments

0 Petitions

Accused Products

Abstract

36 Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Speech compression system and method

First Claim

10 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

36 Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links