Comb filter speech coding with preselected excitation code vectors

US 5,797,119 A
Filed: 02/03/1997
Issued: 08/18/1998
Est. Priority Date: 07/29/1993
Status: Expired due to Term

First Claim

Patent Images

1. A speech encoder comprising:

means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;

means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;

means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples;

means for determining a pitch period of said speech signal from said weighted speech samples;

excitation codebook means for storing excitation code vectors;

first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period;

a comb filter for filtering said candidate code vectors, said comb filter having a delay time set equal to said pitch period;

second selector means for selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion;

gain codebook means having a plurality of gain code vectors; and

gain calculator means, responsive to the comb filtered excitation code vector selected by the second selector means, for selecting one of said gain code vectors from said gain codebook means so that the selected gain code vector further minimizes distortion.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In a code excited speech encoder, an input speech signal is segmented into speech samples at first intervals and a spectral parameter is derived from the speech samples that occur at second intervals longer than the first intervals, the spectral parameter representing the characteristic spectral feature. Each speech sample is weighted with the spectral parameter for producing weighted speech samples. The pitch period of the speech signal is determined from the weighted speech samples. A predetermined number of excitation code vectors having smaller amounts of distortion are selected from excitation codebooks as candidate code vectors. The candidate vectors are comb-filtered with a delay time set equal to the pitch period. One of the filtered code vectors having a minimum distortion is selected. The selected filtered code vector is calculated for minimum distortion and, in response thereto, a gain code vector is selected from a gain codebook. Index signals representing the spectral parameter, the pitch period, the selected excitation and gain code vectors are multiplexed for transmission or storage.

53 Citations

View as Search Results

27 Claims

1. A speech encoder comprising:
- means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  means for determining a pitch period of said speech signal from said weighted speech samples;
  
  excitation codebook means for storing excitation code vectors;
  
  first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period;
  
  a comb filter for filtering said candidate code vectors, said comb filter having a delay time set equal to said pitch period;
  
  second selector means for selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion;
  
  gain codebook means having a plurality of gain code vectors; and
  
  gain calculator means, responsive to the comb filtered excitation code vector selected by the second selector means, for selecting one of said gain code vectors from said gain codebook means so that the selected gain code vector further minimizes distortion.
- View Dependent Claims (2, 3, 16, 22)
- - 2. A speech encoder as claimed in claim 1, wherein said comb filter is a moving average comb filter.
  - 3. A speech encoder as claimed in claim 1, further comprising a multiplexer for multiplexing signals representative of said spectra parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 16. The speech encoder of claim 1 further comprising a mode classifier means wherein said mode classifier means, responsive to results of the means for deriving a spectral parameter, produces a mode classifier signal of one of a first and second level, and said first selector means selects said excitation code vectors in accordance with a first equation when said mode classifier signal is of the first level and selects said excitation vectors in accordance with a second equation when said mode classifier signal is of the second level.
  - 22. The speech encoder of claim 16, wherein when said mode classifier signal is of the first level, said gain calculator means selects said gain code vector to minimize distortion D_k according to the formula:
    - ##EQU15## where h_w (n) is an impulse response;
      
      β
      
      '"'"'_k is the gain of a k-th code vector;
      
      q(n) is a pitch index indicating the pitch period;
      
      C_1jz and C_2iz are the excitation code vectors of a first and second vector stage, respectively;
      
      g'"'"'_1k and g'"'"'_2k are gains of the k-th excitation code vectors of the first and second vector stages, respectively; and
      
      X'"'"'_w (n) is an error-corrected sample from said weighted speech samples; and
      
      wherein when said mode classifier signal is of the second level, said gain calculator means selects said gain code vectors to minimize distortion D_k according to the formula;
      
      ##EQU16##

4. A speech encoder comprising:
- means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  means for determining a pitch period of said speech signal from said weighted speech samples;
  
  excitation codebook means for storing excitation code vectors;
  
  first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate vectors from said excitation codebook means according to said pitch period;
  
  a comb filter for filtering said candidate code vectors and for producing comb filtered code vectors, said comb filter having a delay time set equal to said pitch period;
  
  gain codebook means having a plurality of gain code vectors;
  
  gain calculator means, responsive to each of the comb filtered excitation code vectors selected for minimum distortion, for selecting a gain code vectors corresponding to each of the comb filtered excitation code vector from said gain codebook means so that the selected gain code vector minimizes distortion; and
  
  second selector means for selecting one of said candidate code vectors from the first selector means and selecting one of the gain code vectors selected by the gain calculator means so that the selected candidate code vector and the selected gain code vectors further minimize distortion.
- View Dependent Claims (5, 6, 17, 23)
- - 5. A speech encoder as claimed in claim 4, wherein said comb filter is a moving average comb filter.
  - 6. A speech encoder as claimed in claim 4, further comprising a multiplexer for multiplexing signals representative of said spectra parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 17. The speech encoder of claim 4 further comprising a mode classifier means wherein said mode classifier means, responsive to results of the means for deriving a spectral parameter, produces a mode classifier signal of one of a first and second level, and said first selector means selects said excitation code vectors in accordance with a first equation when said mode classifier signal is of the first level and selects said excitation vectors in accordance with a second equation when said mode classifier signal is of the second level.
  - 23. The speech encoder of claim 17, wherein when said mode classifier signal is of the first level, said gain calculator means selects said gain code vector to minimize distortion D_k according to the formula:
    - ##EQU17## where h_w (n) is an impulse response;
      
      β
      
      '"'"'_k is the gain of a k-th code vector;
      
      q(n) is a pitch index indicating the pitch period;
      
      C_1jz and C_2iz are the excitation code vectors of a first and second vector stage, respectively;
      
      g'"'"'_1k and g'"'"'_2k are gains of the k-th excitation code vectors of the first and second vector stages, respectively; and
      
      X'"'"'_w (n) is an error-corrected sample from said weighted speech samples; and
      
      wherein when said mode classifier signal is of the second level, said gain calculator means selects said gain code vectors to minimize distortion D_k according to the formula;
      
      ##EQU18##

7. A speech encoder comprising:
- means for segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  means for deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  means for weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  means for determining a pitch period of said speech signal from said weighted speech samples;
  
  excitation codebook means having excitation code vectors;
  
  first selector means for selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors from said excitation codebook means according to said pitch period;
  
  gain codebook means having a plurality of gain code vectors;
  
  a comb filter for filtering said candidate code vectors with a delay time equal to said pitch period and with a plurality of weighting functions respectively set equal to gain code vectors stored in said gain codebook means and for producing a plurality of sets of filtered excitation code vectors, said sets corresponding respectively to said candidate code vectors;
  
  gain calculator means, responsive to the filtered excitation code vectors of each set and for selecting, for each set, a gain code vectors from the gain code vectors stored in said gain codebook means so that each of the selected gain code vectors minimizes distortion; and
  
  second selector means for selecting one of said candidate code vectors selected by the first selector means and one of the gain code vectors selected by the gain calculator means so that the selected candidate code vector and the selected gain code vector further minimize distortion.
- View Dependent Claims (8, 9, 18, 24)
- - 8. A speech encoder as claimed in claim 7, wherein said comb filter is a moving average comb filter.
  - 9. A speech encoder as claimed in claim 7, further comprising a multiplexer for multiplexing signals representative of said spectra parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 18. The speech encoder of claim 7 further comprising a mode classifier means wherein said mode classifier means, responsive to results of the means for deriving a spectral parameter, produces a mode classifier signal of one of a first and second level, and said first selector means selects said excitation code vectors in accordance with a first equation when said mode classifier signal is of the first level and selects said excitation vectors in accordance with a second equation when said mode classifier signal is of the second level.
  - 24. The speech encoder of claim 18, wherein when said mode classifier signal is of the first level, said gain calculator means selects said gain code vector to minimize distortion D_k according to the formula:
    - ##EQU19## where h_w (n) is an impulse response;
      
      β
      
      '"'"'_k is the gain of a k-th code vector;
      
      q(n) is a pitch index indicating the pitch period;
      
      C_1j and C_2iz are the excitation code vectors of a first and second vector stage, respectively;
      
      g'"'"'_1k and g'"'"'_2k are gains of the k-th excitation code vectors of the first and second vector stages, respectively; and
      
      X'"'"'_w (n) is an error-corrected sample from said weighted speech samples; and
      
      wherein when said mode classifier signal is of the second level, said gain calculator means selects said gain code vectors to minimize distortion D_k according to the formula;
      
      ##EQU20##

10. A method for encoding a speech signal, comprising the steps of:
- a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  b) deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  d) determining a pitch period of said speech signal from said weighted speech samples;
  
  e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors;
  
  f) comb filtering said candidate code vectors with a delay time equal to said pitch period;
  
  g) selecting one of said comb filtered excitation code vectors so that the selected excitation code vector minimizes distortion; and
  
  h) calculating the selected filtered excitation code vector for minimum distortions and determining a gain code vector so that the gain code vector further minimizes distortion, using either a first equation;
  
  ##EQU9## where h_w (n) is an impulse response;
  
  β
  
  '"'"'_k is the gain of a k-th code vector;
  
  q(n) is a pitch index indicating the pitch period;
  
  C_1jz and C_2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
  
  ##EQU10##
- View Dependent Claims (11, 19, 25)
- - 11. A method as claimed in claim 10, further comprising the step of multiplexing signals representative of said spectral parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 19. The method for encoding a speech signal according to claim 10 further comprising the step of classifying a mode signal in one of a first and second level based on results of said step for deriving a spectral parameter, and wherein in said step for selecting excitation code vectors, said selection is based on the first equation when said mode signal is said first level and said selection is based on the second equation when said mode signal is said second level.
  - 25. A method for encoding a speech signal according to claim 19, wherein when said mode classifier signal is of the first level, the determination to minimize distortion of said step (h) is determined according to the first equation;
    - andwherein when said mode classifier signal is of the second level, the determination to minimize distortion in said step (h) is determined according to the second equation.

12. A method for encoding a speech signal, comprising the steps of:
- a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  b) deriving a spectral parameter from said speech samples at second intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  d) determining a pitch period of said speech signal from said weighted speech samples;
  
  e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors;
  
  f) comb filtering said candidate code vectors with a delay time equal to said pitch period;
  
  g) calculating each of the filtered excitation code vectors for minimum distortion and, selecting a gain code vector from a plurality of gain code vectors so that the selected gain code vector minimizes distortion; and
  
  h) selecting one of said candidate code vectors so that the selected candidate vector and the selected gain code vector further minimize distortion, using either a first equation;
  
  ##EQU11## where h_w (n) is an impulse response;
  
  β
  
  '"'"'_k is the gain of a k-th code vector;
  
  q(n) is a pitch index indicating the pitch period;
  
  C_1jz and C_2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
  
  ##EQU12##
- View Dependent Claims (13, 20, 26)
- - 13. A method as claimed in claim 12, further comprising the step of multiplexing signals representative of said spectral parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 20. The method for encoding a speech signal according to claim 12 further comprising the step of classifying a mode signal in one of a first and second level based on results of said step for deriving a spectral parameter, and wherein in said step for selecting excitation code vectors, said selection is based on the first equation when said mode signal is said first level and said selection is based on the second equation when said mode signal is said second level.
  - 26. A method for encoding a speech signal according to claim 20, wherein when said mode classifier signal is of the first level, the determination to minimize distortion of said step (h) is determined according to the first equation;
    - andwherein when said mode classifier signal is of the second level, the determination to minimize distortion in said step (h) is determined according to the second equation.

14. A method for encoding a speech signal, comprising the steps of:
- a) segmenting an input speech signal having a characteristic spectral feature into speech samples at first intervals;
  
  b) deriving a spectral parameter from said speech samples at intervals longer than said first intervals, and wherein said spectral parameter represents said characteristic spectral feature;
  
  c) weighting each of said speech samples with said spectral parameter for producing weighted speech samples;
  
  d) determining a pitch period of said speech signal from said weighted speech samples;
  
  e) selecting a predetermined number of excitation code vectors having smaller amounts of distortion, relative to other code vectors, as candidate code vectors according to said pitch period from a plurality of excitation codebooks, each codebook having a plurality of excitation code vectors;
  
  f) comb filtering said candidate code vectors with a delay time equal to said pitch period and with a plurality of weighting functions respectively get equal to gain code vectors stored in a gain codebook and producing a plurality of sets of filtered excitation code vectors., said sets corresponding respectively to said candidate code vectors;
  
  g) calculating the filtered excitation code vectors of each set for minimum distortion and, selecting, for each set, a gain code vector from the gain code vectors stored in said gain codebook so that each of the selected gain code vectors minimizes distortion, using either a first equation;
  
  ##EQU13## where h_w (n) is an impulse response;
  
  β
  
  '"'"'_k is the gain of a k-th code vector;
  
  g(n) is a pitch index indicating the pitch period;
  
  C_1jz and C_2jz are the excitation code vectors of a first and second vector stage, respectively, or a second equation;
  
  ##EQU14## h) selecting one of said candidate code vectors selected by the step (e) and one of the gain code vectors selected by the step (g) so that the selected candidate code vector and the selected gain code vector further minimize distortion.
- View Dependent Claims (15, 21, 27)
- - 15. A method as claimed in claim 14, further comprising the step of multiplexing signals representative of said spectral parameter, said pitch period, said selected excitation code vector and said selected gain code vector, respectively, into a composite signal.
  - 21. The method for encoding a speech signal according to claim 14 further comprising the step of classifying a mode signal in one of a first and second level based on results of said step for deriving a spectral parameter, and wherein in said step for selecting excitation code vectors, said selection is based on the first equation when said mode signal is said first level and said selection is based on the second equation when said mode signal is said second level.
  - 27. A method for encoding a speech signal according to claim 21, wherein when said mode classifier signal is of the first level, said selection in said step (h) to minimize distortion is selected according to the first equation, andwherein when said mode classifier signal is of the second level, said selection in said step (h) to minimize distortion is selected according to the second equation.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
NEC Corporation
Original Assignee
NEC Corporation
Inventors
Ozawa, Kazunori
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
RICHARDSON, SCO

Application Number

US08/791,547
Time in Patent Office

561 Days
Field of Search

395/2.26, 395/2.28, 395/2.29, 395/2.3, 395/2.31, 395/2.32, 395/2.17, 704/219, 704/217, 704/220, 704/221, 704/222, 704/223, 704/208, 704/216
US Class Current

704/223
CPC Class Codes

G10L 19/0018   Speech coding using phoneti...

G10L 19/12   the excitation function bei...

G10L 2019/0002   Codebook adaptations

G10L 2019/0013   Codebook search algorithms

G10L 25/18   the extracted parameters be...

Comb filter speech coding with preselected excitation code vectors

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

53 Citations

27 Claims

Specification

Solutions

Use Cases

Quick Links

Comb filter speech coding with preselected excitation code vectors

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

53 Citations

27 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links