Low bit-rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization

US 6,345,248 B1
Filed: 11/02/1999
Issued: 02/05/2002
Est. Priority Date: 09/26/1996
Status: Expired due to Term

First Claim

Patent Images

1. A system for coding speech, the speech being represented as plural speech samples segregated into a frame, the frame being formed of a plurality of subframes, wherein linear predictive coding (LPC) analysis and quantization of the speech samples in the frame are performed to determine an LPC residual signal, the system comprising:

lag means for estimating an unquantized pitch lag value within a predetermined minimum-allowed pitch lag and a predetermined maximum-allowed pitch lag for each subframe within the frame;

means for obtaining a pitch lag vector comprising the unquantized pitch lag values for each subframe within the frame;

a vector quantizer for quantizing the pitch lag vector to generate a quantized pitch lag vector;

means for determining a pitch contribution vector for a current subframe, the pitch contribution vector being adapted to the quantized pitch lag vector;

codebook means for generating an excitation signal representative of the speech samples of the current subframe; and

means for applying the excitation signal of each current subframe to subsequent subframes to provide coded speech for the frame.

View all claims

9 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A pitch lag coding device and method using interframe correlation inherent in pitch lag values to reduce coding bit requirements. A pitch lag value is extracted for a given speech frame, and then refined for each subframe. For every speech frame having N samples of speech, LPC analysis and vector quantization are performed for the whole coding frame. The LPC residual obtained for each frame is then processed such that pitch lag values for all subframes within the coding frame are analyzed concurrently. The remaining coding parameters, i.e., the codebook search, gain parameters, and excitation signal, are then analyzed sequentially according to their respective subframes.

Citations

16 Claims

1. A system for coding speech, the speech being represented as plural speech samples segregated into a frame, the frame being formed of a plurality of subframes, wherein linear predictive coding (LPC) analysis and quantization of the speech samples in the frame are performed to determine an LPC residual signal, the system comprising:
- lag means for estimating an unquantized pitch lag value within a predetermined minimum-allowed pitch lag and a predetermined maximum-allowed pitch lag for each subframe within the frame;
  
  means for obtaining a pitch lag vector comprising the unquantized pitch lag values for each subframe within the frame;
  
  a vector quantizer for quantizing the pitch lag vector to generate a quantized pitch lag vector;
  
  means for determining a pitch contribution vector for a current subframe, the pitch contribution vector being adapted to the quantized pitch lag vector;
  
  codebook means for generating an excitation signal representative of the speech samples of the current subframe; and
  
  means for applying the excitation signal of each current subframe to subsequent subframes to provide coded speech for the frame.
- View Dependent Claims (2, 3, 4)
- - 2. The system claim 1, further comprising:
3. The system of claim 1, wherein the codebook means comprises a codebook having plural codevectors individually representative of characteristics of the speech, each codevector having an associated gain, further wherein the codevector which best represents the speech samples in the current subframe is selected to generate the excitation signal.
4. The system of claim 3, further comprising:
- means for transmitting the coded speech;
  
  a decoder for receiving and processing the coded speech, the decoder including;
  
  means for retrieving the vector quantized pitch lag, the pitch prediction coefficient, and the codevector and gain;
  
  means for reverse quantizing the retrieved vector quantized pitch lag, the pitch prediction coefficient, and the codevector and gain to produce synthesized speech.

5. A system for coding speech, the speech being represented as plural speech samples segregated into a frame, the frame being formed of a plurality of subframes, wherein linear predictive coding (LPC) analysis and quantization of the speech samples in the frame are performed to determine an LPC residual signal r(n), the system comprising:
- means for estimating an open-loop pitch lag value Lag_opbased on the LPC residual signal for the frame of speech;
  
  means for generating a pitch prediction vector R_Lagrepresenting speech samples of a first subframe within the frame, including;
  
  means for constructing an LPC residual signal vector
- View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
- - 6. The system of claim 5, wherein the minimum-allowed pitch lag and the maximum-allowed pitch lag are limited by the open-loop pitch lag value.
  - 7. The system of claim 5, wherein the pitch prediction coefficient is selected to minimize error criteria
  - 8. The system of claim 5, wherein the vector quantizer is a multiple-stage vector quantizer.
  - 9. The system of claim 5, wherein the representative codevector having index i and its associated gain α
    - are calculated by minimizing $[C_{i}, α] = Arg [\underset{j \in [0, Nc], α}{Min} {(Tg - β P_{Lag} - α C_{j}^{'})}^{2}] \rangle .$
  - 10. The system of coding speech of claim 5, wherein the system is included in a speech synthesizer and further comprises:
    - means for transmitting the coded speech;
      
      a decoder for receiving and processing the coded speech, the decoder including;
      
      means for retrieving the vector quantized pitch lag, the pitch prediction coefficient, and the codevector index i and gain;
      
      means for reverse quantizing the retrieved vector quantized pitch lag, the pitch prediction coefficient, and the codevector index and gain to produce synthesized speech.
  - 11. The system of claim 5, wherein the unquantized lag value Lag for each subframe in the frame is determined simultaneously for all subframes using an adaptive open-loop searching technique.
  - 12. The system of claim 5, wherein the system of coding speech in implemented in a computer.
  - 13. The system of claim 5, further comprising a filter for filtering the speech signals before LPC analysis and quantization.

14. A method of coding input speech using pitch lag information, the speech having a linear predictive coding (LPC) residual signal defined by a plurality of LPC residual samples, wherein the current LPC residual sample is determined in the time domain according to a linear combination of past LPC residual samples, further wherein the input speech has a pitch lag which falls within a minimum and maximum range of pitch lag values, the method comprising the steps of:
- processing the input speech;
  
  segregating N samples of the input speech into a frame, dividing the frame into a plurality of subframes, determining the LPC residual signal for each frame;
  
  lag means for estimating an unquantized pitch lag value within the minimum and maximum range of pitch lags for each subframe within the frame based upon the LPC residual signal for the frame;
  
  obtaining a pitch lag vector comprising the unquantized pitch lag values for each subframe within the frame;
  
  generating a quantized pitch lag vector;
  
  determining a pitch contribution vector for a current subframe, the pitch contribution vector being adapted to the quantized pitch lag vector;
  
  generating an excitation signal representative of the speech samples of the current subframe; and
  
  applying the excitation signal of each current subframe to subsequent subframes to provide coded speech for the frame.
- View Dependent Claims (15, 16)
- - 15. The method claim 14, further comprising the steps of:
16. The method of claim 14, further comprising:
- transmitting the coded speech;
  
  decoding the coded speech, including the steps of;
  
  receiving and processing the coded speech, retrieving the vector quantized pitch lag and the pitch prediction coefficient, reverse quantizing the retrieved vector quantized pitch lag and the pitch prediction coefficient to produce synthesized speech.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Samsung Electronics Co. Ltd.
Original Assignee
Conexant Systems Incorporated (Synaptics Incorporated)
Inventors
Li, Tom Hong, Su, Huan-Yu
Primary Examiner(s)
{haeck over (S)}mits, Ta̅livaldis Ivars

Application Number

US09/433,002
Time in Patent Office

826 Days
Field of Search

704/207, 704/219, 704/223
US Class Current

704/223
CPC Class Codes

G10L 19/06   Determination or coding of ...

G10L 19/08   Determination or coding of ...

G10L 2019/0011   Long term prediction filter...

Low bit-rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization

First Claim

9 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Low bit-rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization

First Claim

9 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links