Prototype waveform magnitude quantization for a frequency domain interpolative speech codec system

US 6,996,523 B1
Filed: 02/13/2002
Issued: 02/07/2006
Est. Priority Date: 02/13/2001
Status: Expired due to Term

First Claim

Patent Images

1. A frequency domain interpolative CODEC system for low bit rate coding of speech, comprising:

a linear prediction (LP) front end adapted to process an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;

an open loop pitch estimator adapted to process said LP residual signal, a pitch quantizer, and a pitch interpolator and provide a pitch contour within the predetermined intervals; and

a signal processor responsive to said LP residual signal and the pitch contour and adapted to perform the following steps;

extract a prototype waveform (PW) from the LP residual and the open loop pitch contour for a number of equal sub-intervals within the predetermined intervals;

normalize the PW by said PW'"'"'s gain;

represent a variable dimension PW in a magnitude domain without further decomposition of said PW into complex components in a mean plus deviations form in multiple bands;

compute a voicing measure, said voicing measure characterizing a degree of voicing of said input speech signal and is derived from several input parameters that are correlated to degrees of periodicity of the signal over the predetermined intervals;

provide for a voicing classification for the predetermined intervals based on the computed voicing measure; and

quantize the PW multi-band mean plus deviations for all speech frames in a magnitude domain using a hierarchical quantization method that employs fixed dimension vector quantizers (VQ) with parameters based on the voicing classification.

View all claims

13 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method is provided that employs a frequency domain interpolative CODEC system for low bit rate coding of speech which comprises a linear prediction (LP) front end adapted to process an input signal that provides LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal. An open loop pitch estimator adapted to process the LP residual signal, a pitch quantizer, and a pitch interpolator and provide a pitch contour within the predetermined intervals is also provided. Also provided is a signal processor responsive to the LP residual signal and the pitch contour and adapted to perform the following: provide a voicing measure, where the voicing measure characterizes a degree of voicing of the input speech signal and is derived from several input parameters that are correlated to degrees of periodicity of the signal over the predetermined intervals; extract a prototype waveform (PW) from the LP residual and the open loop pitch contour for a number of equal sub-intervals within the predetermined intervals; normalize the PW by a gain value of the PW; encode a magnitude of the PW; and directly quantize the PW in a magnitude domain without further decomposition of the PW into complex components, where the direct quantization is performed by a hierarchical quantization method based on a voicing classification using fixed dimension vector quantizers (VQ'"'"'s).

Citations

8 Claims

1. A frequency domain interpolative CODEC system for low bit rate coding of speech, comprising:
- a linear prediction (LP) front end adapted to process an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
  
  an open loop pitch estimator adapted to process said LP residual signal, a pitch quantizer, and a pitch interpolator and provide a pitch contour within the predetermined intervals; and
  
  a signal processor responsive to said LP residual signal and the pitch contour and adapted to perform the following steps;
  
  extract a prototype waveform (PW) from the LP residual and the open loop pitch contour for a number of equal sub-intervals within the predetermined intervals;
  
  normalize the PW by said PW'"'"'s gain;
  
  represent a variable dimension PW in a magnitude domain without further decomposition of said PW into complex components in a mean plus deviations form in multiple bands;
  
  compute a voicing measure, said voicing measure characterizing a degree of voicing of said input speech signal and is derived from several input parameters that are correlated to degrees of periodicity of the signal over the predetermined intervals;
  
  provide for a voicing classification for the predetermined intervals based on the computed voicing measure; and
  
  quantize the PW multi-band mean plus deviations for all speech frames in a magnitude domain using a hierarchical quantization method that employs fixed dimension vector quantizers (VQ) with parameters based on the voicing classification.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. A system as recited in claim 1 wherein the representation of the variable dimension PW is in fixed but unequal bands at each sub-interval, and the means are computed as a spectrally weighted average of the PW magnitude in each band and at each sub-interval.
  - 3. A system as recited in claim 2, wherein in the quantization step a fixed dimensional PW mean vector is derived using all the PW means as its elements at each sub-interval.
  - 4. A system as recited in claim 3, wherein for frames classified as voiced the quantization step comprises:
    - a backward predictive vector quantization of the fixed dimensional PW means vector for a last sub-interval;
      
      reconstruction of the quantized PW means vector for the last sub-interval by inverse backward vector quantization; and
      
      reconstruction of the quantized PW means vector for intermediate sub-intervals by linear interpolation.
  - 5. A systems as recited in claim 3 wherein for frames classified as unvoiced the quantization step comprises:
    - a backward predictive vector quantization of the fixed dimensional PW means vector for a middle sub-interval;
      
      a backward predictive vector quantization of the fixed dimensional PW means vector for a last sub-interval;
      
      reconstruction of the quantized PW means vector for the middle sub-interval by inverse backward predictive vector quantization;
      
      reconstruction of the quantized PW means vector for the last sub-interval by inverse backward predictive vector quantization; and
      
      reconstruction of the quantized PW means vector for intermediate sub-intervals by linear interpolation.
  - 6. A system as recited in claim 3 wherein the quantization step comprises:
    - derivation of a variable dimensional PW deviations vector as a difference between the PW magnitude spectra and a reconstructed quantized means in each band and for each sub-interval;
      
      selection of a fixed number of perceptually significant harmonics at each of a plurality of selected time instants by a procedure that emphasizes low frequencies while precluding frequencies below 200 Hz at each said selected time instant; and
      
      conversion of the variable dimensional PW deviations vector to a fixed dimensional PW deviations vector comprising elements that are PW deviations at the selected harmonics.
  - 7. A system as recited in claim 6 further comprising of the following steps for frames classified as voiced:
    - backward predictive multi-stage vector quantization of the fixed dimensional PW deviations vector for a middle sub-interval;
      
      backward predictive multi-stage vector quantization of the fixed dimensional PW deviations vector for a last sub-interval;
      
      reconstruction of the fixed dimensional quantized PW deviations vector for the middle sub-interval by inverse backward predictive vector quantization;
      
      reconstruction of the fixed dimensional quantized PW deviations vector for the last sub-interval by inverse backward predictive vector quantization;
      
      reconstruction of the variable dimensional quantized PW vector for the middle and last sub-intervals as a sum of the reconstructed quantized PW mean at each harmonic frequency plus a harmonic deviation if the harmonic frequency is one of the selected harmonics; and
      
      reconstruction of the variable dimensional quantized PW vector for intermediate sub-intervals by linear interpolation.
  - 8. A system as recited in claim 6 further comprising of the following steps for frames classified as unvoiced:
    - vector quantization of the fixed dimensional PW deviations vector for a middle sub-interval;
      
      vector quantization of the fixed dimensional PW deviations vector for a last sub-interval;
      
      reconstruction of the fixed dimensional quantized PW deviations vector for the middle sub-interval by inverse vector quantization;
      
      reconstruction of the fixed dimensional quantized PW deviations vector for the last sub-frame by inverse vector quantization;
      
      reconstruction of the variable dimensional quantized PW vector for the middle and last sub-intervals as a sum of the reconstructed quantized PW mean at each harmonic frequency plus a harmonic deviation if the harmonic frequency is one of the selected harmonics; and
      
      reconstruction of the variable dimensional quantized PW vector for intermediate sub-intervals by linear interpolation.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hughes Network Systems LLC (Echostar Corporation)
Original Assignee
Hughes Electronics Corporation (AT&T, Inc.)
Inventors
Bhaskar, Udaya, Swaminathan, Kumar
Primary Examiner(s)
Azad, Abul K.

Application Number

US10/073,128
Time in Patent Office

1,455 Days
Field of Search

704/207, 704/216, 704/221, 704/222, 704/223, 704/230, 704/219, 704/265
US Class Current

704/222
CPC Class Codes

G10L 19/032 Quantisation or dequantisat...

G10L 19/097 using prototype waveform de...

Prototype waveform magnitude quantization for a frequency domain interpolative speech codec system

First Claim

13 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Prototype waveform magnitude quantization for a frequency domain interpolative speech codec system

First Claim

13 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links