Voicing measure as an estimate of signal periodicity for a frequency domain interpolative speech codec system

US 6,691,092 B1
Filed: 04/04/2000
Issued: 02/10/2004
Est. Priority Date: 04/05/1999
Status: Expired due to Term

First Claim

Patent Images

1. A frequency domain interpolative coding system for low bit-rate coding of speech signals, comprising:

a linear prediction (LP) front end, responsive to an input signal, providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual;

an open loop pitch estimator, responsive to said LP residual signal, a pitch quantizer, and a pitch interpolator yielding a pitch contour within the predetermined interval;

a voice activity detector (VAD) mechanism responsive to said LP parameters and open loop pitch, generating a VAD flag for every predetermined interval;

a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval; and

said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;

a separation of the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component using a low-pass filter along every pitch harmonic track;

a representation of one or more of the components of the normalized PW in spectral magnitude-phase form; and

a characterization of the degree of periodicity of the input signal by a voicing measure, derived from certain parameters that are correlated to signal periodicity and computed from the input signal, PW, SEW and REW over the predetermined interval.

View all claims

13 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system determines a voicing measure as a measure of the degree of signal periodicity and uses the determined voicing measure to quantize the spectral magnitude of the slowly evolving waveform (SEW) and the modeling of the SEW and rapidly evolving waveform (REW) phase spectra.

Citations

12 Claims

1. A frequency domain interpolative coding system for low bit-rate coding of speech signals, comprising:
- a linear prediction (LP) front end, responsive to an input signal, providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual;
  
  an open loop pitch estimator, responsive to said LP residual signal, a pitch quantizer, and a pitch interpolator yielding a pitch contour within the predetermined interval;
  
  a voice activity detector (VAD) mechanism responsive to said LP parameters and open loop pitch, generating a VAD flag for every predetermined interval;
  
  a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval; and
  
  said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
  
  a separation of the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component using a low-pass filter along every pitch harmonic track;
  
  a representation of one or more of the components of the normalized PW in spectral magnitude-phase form; and
  
  a characterization of the degree of periodicity of the input signal by a voicing measure, derived from certain parameters that are correlated to signal periodicity and computed from the input signal, PW, SEW and REW over the predetermined interval.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. A system as recited in claim 1, comprising a decoder using a voicing measure for regenerating the phase spectra of the SEW and REW components for every sub-interval.
  - 3. A system as recited in claim 2, wherein said decoder voicing measure is used for improved quantization of the SEW magnitude component by selecting the codebook used for quantization from a set of codebooks based on the degree of periodicity as represented by the voicing measure.
  - 4. A system as recited in claim 2, further comprising:
5. A system as recited in claim 4, wherein a set of the neural network input parameters for voicing measure determination comprises the SEW variance, root-mean square value of SEW, and open loop pitch gain.
6. A system as recited in claim 4, wherein an auxiliary set of the neural network input parameters comprises a relative power level of the input signal, root-mean-square value of the REW component, a measure of peakiness of the prediction residual over a pitch cycle and the normalized autocorrelation coefficient of the input signal at unit lag.
7. A system as recited in claim 1, wherein said signal processor performs an error concealment procedure for the voicing measure to increase the robustness of the speech codec in the presence of transmission errors by computing a VAD likelihood measure based on previously received VAD flags, comprising:
- a state machine relying on the correlation between the voicing measure and the VAD likelihood measure; and
  
  a second state machine relying on the correlation between the root-mean-square value of SEW in a predetermined low frequency band and the voicing measure.

8. A frequency domain interpolative coding system for low bit-rate coding of speech signals, comprising:
- a linear prediction (LP) front end responsive to an input signal, providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
  
  an open loop pitch estimator responsive to said LP residual signal, a pitch quantizer, and a pitch interpolator yielding a pitch contour within the predetermined interval;
  
  a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval; and
  
  said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
  
  a separation of the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component using a low pass filter along every pitch harmonic track;
  
  a characterization of the degree of periodicity of the input signal by a voicing measure, derived from certain parameters that are correlated to signal periodicity and computed from the input signal, PW, SEW and REW over the predetermined interval;
  
  a representation of the SEW component in spectral magnitude-phase form and transmission of only the spectral magnitude information of the SEW component; and
  
  a reconstruction of the SEW and REW phase components at the decoder using the received SEW and REW magnitude components, the voicing measure, and pitch frequency contour information.
- View Dependent Claims (9, 10, 11, 12)
- - 9. A system as recited in claim 8, comprising a decoder using a voicing measure processing the input parameters with a neural network.
  - 10. A system as recited in claim 9, comprising a state machine for performing error concealment for voicing measure at the decoder.
  - 11. A system as recited in claim 10, wherein said signal processor correlates the voicing measure and a voice activity detection (VAD) likelihood measure derived from previously received VAD flags for error concealment of the voicing measure.
  - 12. A system as recited in claim 10, wherein said signal processor correlates the voicing measure and the root-mean-square value of SEW in a predetermined low frequency band for error concealment of the voicing measure.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hughes Network Systems LLC (Echostar Corporation)
Original Assignee
Hughes Electronics Corporation (AT&T, Inc.)
Inventors
Swaminathan, Kumar, Zakaria, Gaguk, Udaya Bhaskar, Bangalore R., Nandkumar, Srinivas
Primary Examiner(s)
CHAWAN, VIJAY B

Application Number

US09/542,390
Time in Patent Office

1,407 Days
Field of Search

704/208, 704/219, 704/223, 704/220, 704/206, 704/222, 704/230, 704/205, 704/207, 704/225, 704/226, 704/265
US Class Current

704/265
CPC Class Codes

G10L 19/097 using prototype waveform de...

G10L 2025/783 based on threshold decision

Voicing measure as an estimate of signal periodicity for a frequency domain interpolative speech codec system

First Claim

13 Assignments

0 Petitions

Accused Products

Abstract

Citations

12 Claims

Specification

Solutions

Use Cases

Quick Links

Voicing measure as an estimate of signal periodicity for a frequency domain interpolative speech codec system

First Claim

13 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

12 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links