Low bit rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization
First Claim
1. A system for coding speech, the speech being represented as plural speech samples segregated into a frame, the frame being formed of a plurality of subframes, wherein linear predictive coding (LPC) analysis and quantization of the speech samples in the frame are performed to determine an LPC residual signal, the system comprising:
- lag means for estimating an unquantized pitch lag value within a predetermined minimum-allowed pitch lag and a predetermined maximum-allowed pitch lag for each subframe within the frame, including;
means for constructing an LPC residual signal vector for the frame of speech,means for estimating an open-loop pitch lag value based on the LPC residual signal vector, the open-loop pitch lag value lying within the predetermined minimum-allowed pitch lag and the predetermined maximum-allowed pitch lag;
a synthesis filter for filtering the LPC residual signal vector to produce a target signal;
means for generating a residual-based pitch contribution vector for each subframe within the frame;
means for perceptually filtering each residual-based pitch contribution vector to obtain a perceptually-filtered residual-based pitch contribution vector; and
means for estimating the unquantized pitch lag value for each subframe by considering a plurality of pitch lag values that are located around the open-loop pitch lag value within a subset of values that are within the predetermined minimum and maximum-allowed pitch lags and determining which corresponds to a perceptually-filtered residual-based pitch contribution vector that is closest to the target signal;
means for obtaining an unquantized pitch lag vector comprising the unquantized pitch lag values for each subframe within the frame;
a vector quantizer for quantizing the unquantized pitch lag vector to generate a quantized pitch lag vector containing quantized pitch lag values corresponding to each subframe;
means for determining an excitation-based pitch contribution vector for a current subframe based on the corresponding quantized pitch lag vector;
codebook means for generating an excitation signal representative of the speech samples of the current subframe; and
means for applying the excitation signal of each current subframe to subsequent subframes to provide coded speech for the frame.
11 Assignments
0 Petitions
Accused Products
Abstract
A pitch lag coding device and method using interframe correlation inherent in pitch lag values to reduce coding bit requirements. A pitch lag value is extracted for a given speech frame, and then refined for each subframe. For every speech frame having N samples of speech, LPC analysis and vector quantization are performed for the whole coding frame. The LPC residual obtained for each frame is then processed such that pitch lag values for all subframes within the coding frame are analyzed concurrently. The remaining coding parameters, i.e., the codebook search, gain parameters, and excitation signal, are then analyzed sequentially according to their respective subframes.
45 Citations
3 Claims
-
1. A system for coding speech, the speech being represented as plural speech samples segregated into a frame, the frame being formed of a plurality of subframes, wherein linear predictive coding (LPC) analysis and quantization of the speech samples in the frame are performed to determine an LPC residual signal, the system comprising:
- lag means for estimating an unquantized pitch lag value within a predetermined minimum-allowed pitch lag and a predetermined maximum-allowed pitch lag for each subframe within the frame, including;
means for constructing an LPC residual signal vector for the frame of speech, means for estimating an open-loop pitch lag value based on the LPC residual signal vector, the open-loop pitch lag value lying within the predetermined minimum-allowed pitch lag and the predetermined maximum-allowed pitch lag; a synthesis filter for filtering the LPC residual signal vector to produce a target signal; means for generating a residual-based pitch contribution vector for each subframe within the frame; means for perceptually filtering each residual-based pitch contribution vector to obtain a perceptually-filtered residual-based pitch contribution vector; and means for estimating the unquantized pitch lag value for each subframe by considering a plurality of pitch lag values that are located around the open-loop pitch lag value within a subset of values that are within the predetermined minimum and maximum-allowed pitch lags and determining which corresponds to a perceptually-filtered residual-based pitch contribution vector that is closest to the target signal; means for obtaining an unquantized pitch lag vector comprising the unquantized pitch lag values for each subframe within the frame; a vector quantizer for quantizing the unquantized pitch lag vector to generate a quantized pitch lag vector containing quantized pitch lag values corresponding to each subframe; means for determining an excitation-based pitch contribution vector for a current subframe based on the corresponding quantized pitch lag vector; codebook means for generating an excitation signal representative of the speech samples of the current subframe; and means for applying the excitation signal of each current subframe to subsequent subframes to provide coded speech for the frame. - View Dependent Claims (2, 3)
- lag means for estimating an unquantized pitch lag value within a predetermined minimum-allowed pitch lag and a predetermined maximum-allowed pitch lag for each subframe within the frame, including;
Specification