Spectral magnitude modeling and quantization in a frequency domain interpolative speech codec system
First Claim
Patent Images
1. A frequency domain interpolative coding system for magnitude modeling and quantization of prototype waveforms, comprising:
- a linear prediction (LP) front end responsive to an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
an open loop pitch estimator responsive to said LP residual signal;
a pitch quantizer;
a pitch interpolator;
said open loop pitch estimator, said pitch quantizer, and said pitch interpolator yielding a pitch contour within the predetermined interval;
a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval;
said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
a low-pass filter for separating the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component along every pitch harmonic track;
a voicing measure for characterizing the degree of signal periodicity over the predetermined interval, derived from the input signal, PW, SEW and REW characteristics; and
a quantizer for quantization of the variable dimension SEW magnitude vector by a hierarchical approach, comprising fixed dimension vector quantizers (VQs) for each predetermined interval.
13 Assignments
0 Petitions
Accused Products
Abstract
Encoding of prototype waveform components applicable to telecommunication systems provides improved voice quality enabling a dual-channel mode of operation which permits more users to communicate over the same physical channel. A prototype word (PW) gain is vector quantized using a vector quantizer (VQ) that explicitly populates a codebook by representative steady state and transient vectors of PW gain for tracking the abrupt variations in speech levels during onsets and other non-stationary events, while maintaining the accuracy of the speech level during stationary conditions.
97 Citations
19 Claims
-
1. A frequency domain interpolative coding system for magnitude modeling and quantization of prototype waveforms, comprising:
-
a linear prediction (LP) front end responsive to an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
an open loop pitch estimator responsive to said LP residual signal;
a pitch quantizer;
a pitch interpolator;
said open loop pitch estimator, said pitch quantizer, and said pitch interpolator yielding a pitch contour within the predetermined interval;
a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval;
said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
a low-pass filter for separating the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component along every pitch harmonic track;
a voicing measure for characterizing the degree of signal periodicity over the predetermined interval, derived from the input signal, PW, SEW and REW characteristics; and
a quantizer for quantization of the variable dimension SEW magnitude vector by a hierarchical approach, comprising fixed dimension vector quantizers (VQs) for each predetermined interval. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of joint vector quantization of rms and shape components, comprising:
-
computation of a fixed dimension SEW mean vector by averaging the SEW harmonic magnitudes over a set of sub-bands that span the bandwidth of the input signal;
computation of a variable dimension SEW deviation vector by subtracting the SEW mean from the SEW magnitude vector;
determination of a fixed dimension SEW deviation subvector from the variable dimension SEW deviation vector based on a dynamic frequency selection approach which uses the quantized LP parameter based spectral characteristics of the input signal during the current interval; and
joint quantization of the rms and shape components of the fixed dimension SEW deviation subvector using a frequency weighted distortion measure.
-
-
8. A method of fixed dimension quantization of the mean component of the SEW magnitude vector, comprising:
-
determination of the full dimension SEW deviation vector from the quantized fixed dimension SEW deviation subvector by padding with zero values or the output of the SEW shape predictor;
computation of a full dimension target SEW vector by sub-band computing the difference between the SEW magnitude vector and the reconstructed full dimension SEW deviation vector;
quantization of the fixed dimension SEW mean vector, so that when converted to a full dimension vector by a piecewise-constant construction, a frequency weighted distortion measure is minimized with respect to the target SEW vector; and
selection of a SEW mean codebook from two possible codebooks based on the degree of voicing using the encoded voicing measure. - View Dependent Claims (9)
-
-
10. A frequency domain interpolative coding system for magnitude modeling and quantization of prototype waveforms, comprising:
-
a linear prediction (LP) front end responsive to an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
an open loop pitch estimator responsive to said LP residual signal;
a pitch quantizer;
a pitch interpolator;
said open loop pitch estimator, said pitch quantizer, and said pitch interpolator yielding a pitch contour within the predetermined interval;
a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval;
said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
a low-pass filter for separating the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component along every pitch harmonic track;
a voicing measure for characterizing the degree of signal periodicity over the predetermined interval, derived from input signal, PW, SEW and REW characteristics; and
a fixed dimension vector quantizer for quantization of a variable dimension REW magnitude component using a hierarchical approach for each predetermined interval. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A frequency domain interpolative coding method for magnitude modeling and quantization of prototype waveforms, comprising:
-
a linear prediction (LP) front end responsive to an input signal providing LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal;
an open loop pitch estimator responsive to said LP residual signal;
a pitch quantizer;
a pitch interpolator;
said open loop pitch estimator, said pitch quantizer, and said pitch interpolator yielding a pitch contour within the predetermined interval;
a signal processor responsive to said LP residual signal and the pitch contour for extracting a prototype waveform (PW) for a number of equal sub-intervals within the predetermined interval;
said signal processor computing a PW gain for generating a normalized PW for each sub-interval and a PW gain vector for the predetermined interval;
a low-pass filter for separating the normalized PW into a slowly evolving waveform (SEW) component and a rapidly evolving waveform (REW) component along every pitch harmonic track;
a voicing measure for characterizing the degree of signal periodicity over the predetermined interval, derived from the input signal, PW, SEW and REW characteristics; and
quantization of the variable dimension SEW magnitude vector by a hierarchical approach, using fixed dimension vector quantizers (VQs) for each predetermined interval. - View Dependent Claims (17, 18, 19)
-
Specification