Comb filter speech coding with preselected excitation code vectors
First Claim
1. A speech encoder comprising:
- means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
means for determining a pitch period of said speech signal from said weighted speech samples;
excitation codebook means for storing excitation code vectors;
first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period;
a comb filter for filtering said candidate code vectors, said comb filter having a delay time set equal to said pitch period;
second selector means for selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion;
gain codebook means having a plurality of gain code vectors; and
gain calculator means, responsive to the comb filtered excitation code vector selected by the second selector means, for selecting one of said gain code vectors from said gain codebook means so that the selected gain code vector further minimizes distortion.
0 Assignments
0 Petitions
Accused Products
Abstract
In a code excited speech encoder, an input speech signal is segmented into speech samples at first intervals and a spectral parameter is derived from the speech samples that occur at second intervals longer than the first intervals, the spectral parameter representing the characteristic spectral feature. Each speech sample is weighted with the spectral parameter for producing weighted speech samples. The pitch period of the speech signal is determined from the weighted speech samples. A predetermined number of excitation code vectors having smaller amounts of distortion are selected from excitation codebooks as candidate code vectors. The candidate vectors are comb-filtered with a delay time set equal to the pitch period. One of the filtered code vectors having a minimum distortion is selected. The selected filtered code vector is calculated for minimum distortion and, in response thereto, a gain code vector is selected from a gain codebook. Index signals representing the spectral parameter, the pitch period, the selected excitation and gain code vectors are multiplexed for transmission or storage.
53 Citations
27 Claims
-
1. A speech encoder comprising:
-
means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples; means for determining a pitch period of said speech signal from said weighted speech samples; excitation codebook means for storing excitation code vectors; first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period; a comb filter for filtering said candidate code vectors, said comb filter having a delay time set equal to said pitch period; second selector means for selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion; gain codebook means having a plurality of gain code vectors; and gain calculator means, responsive to the comb filtered excitation code vector selected by the second selector means, for selecting one of said gain code vectors from said gain codebook means so that the selected gain code vector further minimizes distortion. - View Dependent Claims (2, 3, 16, 22)
-
-
4. A speech encoder comprising:
-
means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples; means for determining a pitch period of said speech signal from said weighted speech samples; excitation codebook means for storing excitation code vectors; first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate vectors from said excitation codebook means according to said pitch period; a comb filter for filtering said candidate code vectors and for producing comb filtered code vectors, said comb filter having a delay time set equal to said pitch period; gain codebook means having a plurality of gain code vectors; gain calculator means, responsive to each of the comb filtered excitation code vectors selected for minimum distortion, for selecting a gain code vectors corresponding to each of the comb filtered excitation code vector from said gain codebook means so that the selected gain code vector minimizes distortion; and second selector means for selecting one of said candidate code vectors from the first selector means and selecting one of the gain code vectors selected by the gain calculator means so that the selected candidate code vector and the selected gain code vectors further minimize distortion. - View Dependent Claims (5, 6, 17, 23)
-
-
7. A speech encoder comprising:
-
means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples; means for determining a pitch period of said speech signal from said weighted speech samples; excitation codebook means having excitation code vectors; first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period; gain codebook means having a plurality of gain code vectors; a comb filter for filtering said candidate code vectors with a delay time equal to said pitch period and with a plurality of weighting functions respectively set equal to gain code vectors stored in said gain codebook means and for producing a plurality of sets of filtered excitation code vectors, said sets corresponding respectively to said candidate code vectors; gain calculator means, responsive to the filtered excitation code vectors of each set and for selecting, for each set, a gain code vectors from the gain code vectors stored in said gain codebook means so that each of the selected gain code vectors minimizes distortion; and second selector means for selecting one of said candidate code vectors selected by the first selector means and one of the gain code vectors selected by the gain calculator means so that the selected candidate code vector and the selected gain code vector further minimize distortion. - View Dependent Claims (8, 9, 18, 24)
-
-
10. A method for encoding a speech signal, comprising the steps of:
-
a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; b) deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples; d) determining a pitch period of said speech signal from said weighted speech samples; e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors; f) comb filtering said candidate code vectors with a delay time equal to said pitch period; g) selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion; and h) calculating the selected filtered excitation code vector for minimum distortions and determining a gain code vector so that the gain code vector further minimizes distortion, using either a first equation;
##EQU9## where hw (n) is an impulse response;
β
'"'"'k is the gain of a k-th code vector;q(n) is a pitch index indicating the pitch period; C1jz and C2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
##EQU10## - View Dependent Claims (11, 19, 25)
-
-
12. A method for encoding a speech signal, comprising the steps of:
-
a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; b) deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples; d) determining a pitch period of said speech signal from said weighted speech samples; e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors; f) comb filtering said candidate code vectors with a delay time equal to said pitch period; g) calculating each of the filtered excitation code vectors for minimum distortion and, selecting a gain code vector from a plurality of gain code vectors so that the selected gain code vector minimizes distortion; and h) selecting one of said candidate code vectors so that the selected candidate vector and the selected gain code vector further minimize distortion, using either a first equation;
##EQU11## where hw (n) is an impulse response;β
'"'"'k is the gain of a k-th code vector;q(n) is a pitch index indicating the pitch period; C1jz and C2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
##EQU12## - View Dependent Claims (13, 20, 26)
-
-
14. A method for encoding a speech signal, comprising the steps of:
-
a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals; b) deriving a spectral parameter from said speech samples at intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature; c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples; d) determining a pitch period of said speech signal from said weighted speech samples; e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors; f) comb filtering said candidate code vectors with a delay time equal to said pitch period and with a plurality of weighting functions respectively get equal to gain code vectors stored in a gain codebook and producing a plurality of sets of filtered excitation code vectors., said sets corresponding respectively to said candidate code vectors; g) calculating the filtered excitation code vectors of each set for minimum distortion and, selecting, for each set, a gain code vector from the gain code vectors stored in said gain codebook so that each of the selected gain code vectors minimizes distortion, using either a first equation;
##EQU13## where hw (n) is an impulse response;β
'"'"'k is the gain of a k-th code vector;g(n) is a pitch index indicating the pitch period; C1jz and C2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
##EQU14## h) selecting one of said candidate code vectors selected by the step (e) and one of the gain code vectors selected by the step (g) so that the selected candidate code vector and the selected gain code vector further minimize distortion. - View Dependent Claims (15, 21, 27)
-
Specification