Speech coding

US 6,470,313 B1
Filed: 03/04/1999
Issued: 10/22/2002
Est. Priority Date: 03/09/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method of coding a speech signal which signal comprises a sequence of subframes containing digitised speech samples, the method comprising, for each subframe:

(a) selecting a quantised vector d(i) comprising at least one pulse, wherein the number m and position of pulses in the vector d(i) may vary between subframes;

(b) determining a gain value g_cfor scaling the amplitude of the quantised vector d(i) or of a further vector c(i) derived from the quantised vector d(i), wherein the scaled vector synthesizes a weighted residual signal {tilde over (s)};

(c) determining a scaling factor k which is a function of the ratio of a predetermined energy level to the energy in the quantised vector d(i);

(d) determining a predicted gain value ĝ

_con the basis of one or more previously processed subframes, and as a function of the energy E_cof the quantised vector d(i) or said further vector c(i) when the amplitude of the vector is scaled by said scaling factor k; and

(e) determining a quantised gain correction factor {circumflex over (γ

)}_gcusing said gain value g_cand said predicted gain value ĝ

_c.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A variable bit-rate speech coding method determines for each subframe a quantised vector d(i) comprising a variable number of pulses. An excitation vector c(i) for exciting LTP and LPC synthesis filters is derived by filtering the quantised vector d(i), and a gain value g_cis determined for scaling the pulse amplitude excitation vector c(i) such that the scaled excitation vector represents the weighted residual signal {tilde over (s)} remaining in the subframe speech signal after removal of redundant information by LPC and LTP analysis. A predicted gain value ĝ_cis determined from previously processed subframes, and as a function of the energy E_ccontained in the excitation vector c(i) when the amplitude of that vector is scaled in dependence upon the number of pulses m in the quantised vector d(i). A quantised gain correction factor {circumflex over (γ)}_gcis then determined using the gain value g_cand the predicted gain value ĝ_c.

Citations

16 Claims

1. A method of coding a speech signal which signal comprises a sequence of subframes containing digitised speech samples, the method comprising, for each subframe:
- (a) selecting a quantised vector d(i) comprising at least one pulse, wherein the number m and position of pulses in the vector d(i) may vary between subframes;
  
  (b) determining a gain value g_cfor scaling the amplitude of the quantised vector d(i) or of a further vector c(i) derived from the quantised vector d(i), wherein the scaled vector synthesizes a weighted residual signal {tilde over (s)};
  
  (c) determining a scaling factor k which is a function of the ratio of a predetermined energy level to the energy in the quantised vector d(i);
  
  (d) determining a predicted gain value ĝ
  
  _con the basis of one or more previously processed subframes, and as a function of the energy E_cof the quantised vector d(i) or said further vector c(i) when the amplitude of the vector is scaled by said scaling factor k; and
  
  (e) determining a quantised gain correction factor {circumflex over (γ
  
  )}_gcusing said gain value g_cand said predicted gain value ĝ
  
  _c.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. A method according to claim 1, the method being a variable bit-rate coding method and comprising:
3. A method according to claim 1 and comprising:
- generating a set of linear predictive coding (LPC) coefficients a for each frame and a set of long term prediction (LTP) parameters b for each subframe, wherein a frame comprises a plurality of speech subframes; and
  
  producing a coded speech signal on the basis of the LPC coefficients, the LTP parameters, the quantised vector d(i), and the quantised gain correction factor {circumflex over (γ
  
  )}_gc.
4. A method according to claim 1 and comprising defining the quantised vector d(i) in the coded signal by an algebraic code u.
5. A method according to claim 1, wherein the predicted gain value is determined according to the equation:
6. A method according to claim 1, wherein said predicted gain value ĝ
- _cis a function of the mean removed excitation energy E(n) of the quantised vector d(i) or said further vector c(i), of each of said previously processed subframes, when the amplitude of the vector is scaled by said scaling factor k.
7. A method according to claim 1, wherein the gain value g_cis used to scale said further vector c(i), and that further vector is generated by filtering the quantised vector d(i).
8. A method according to claim 5, wherein:
- said predicted gain value ĝ
  
  _cis a function of the mean removed excitation energy E(n) of the quantised vector d(i) or said further vector c(i), of each of said previously processed subframes, when the amplitude of the vector is scaled by said scaling factor k;
  
  the gain value g_cis used to scale said further vector c(i), and that further vector is generated by filtering the quantised vector d(i); and
  
  the predicted energy is determined using the equation;
  
  $\hat{E} (n) = \sum_{i = 1}^{p} b_{i} \hat{R} (n - i)$ where b_iare the moving average prediction coefficients, p is the prediction order, and {circumflex over (R)}(j) is the error in the predicted energy Ê
  
  (j) at previous subframe j, given by;
9. A method according to claim 5, wherein the term E_cis determined using the equation:
- $E_{c} = 10 \log (\frac{1}{N} \sum_{i = 0}^{N - 1} {(kc (i))}^{2})$ where N is the number of samples in the subframe.
10. A method according to claim 1, wherein, if the quantisation vector d(i) comprises two or more pulses, all of the pulses have the same amplitude.
11. A method according to claim 1, wherein the scaling factor is given by:
- $k = \sqrt{\frac{M}{m}}$ where M is the maximum permissible number of pulses in the quantised vector d(i).
12. A method according to claim 1 and comprising searching a gain correction factor codebook to determine the quantised gain correction factor {circumflex over (γ
- )}_gcwhich minimises the error;

13. A method of decoding a sequence of coded subframes of a digitised sampled speech signal, the method comprising for each subframe:
- (a) recovering from the coded signal a quantised vector d(i) comprising at least one pulse, wherein the number m and position of pulses in the vector d(i) may vary between subframes;
  
  (b) recovering from the coded signal a quantised gain correction factor {circumflex over (γ
  
  )}_gc;
  
  (c) determining a scaling factor k which is a function of the ratio of a predetermined energy level to the energy in the quantised vector d(i);
  
  (d) determining a predicted gain value ĝ
  
  _con the basis of one or more previously processed subframes, and as a function of the energy E_cof the quantised vector d(i) or a further vector c(i) derived from the quantised vector, when the amplitude of the vector is scaled by said scaling factor k; and
  
  (e) correcting the predicted gain value ĝ
  
  _cusing the quantised gain correction factor {circumflex over (γ
  
  )}_gcto provide a corrected gain value g_c; and
  
  (f) scaling the quantised vector d(i) or said further vector c(i) using the gain value g_cto generate an excitation vector synthesizing a residual signal {tilde over (s)} remaining in the original subframe speech signal after removal of substantially redundant information therefrom.
- View Dependent Claims (14)
- - 14. A method according to claim 13, wherein each coded subframe of the received signal comprises an algebraic code u defining the quantised vector d(i) and an index addressing a quantised gain correction factor codebook from where the quantised gain correction factor {circumflex over (γ
    - )}_gcis obtained.

15. Apparatus for coding a speech signal which signal comprises a sequence of subframes containing digitised speech samples, the apparatus having means for coding each of said subframes in turn, which means comprises:
- vector selecting means for selecting a quantised vector d(i) comprising at least one pulse, wherein the number m and position of pulses in the vector d(i) may vary between subframes;
  
  first signal processing means for determining a gain value g_cfor scaling the amplitude of the quantised vector d(i) or a further vector c(i) derived from the quantised vector d(i), wherein the scaled vector synthesizes a weighted residual signal {tilde over (s)};
  
  second signal processing means for determining a scaling factor k which is a function of the ratio of a predetermined energy level to the energy in the quantised vector d(i);
  
  third signal processing means for determining a predicted gain value ĝ
  
  _con the basis of one or more previously processed subframes, and as a function of the energy E_cof the quantised vector d(i) or said further vector c(i), when the amplitude of the vector is scaled by said scaling factor k; and
  
  fourth signal processing means for determining a quantised gain correction factor {circumflex over (γ
  
  )}_gcusing said gain value g_cand said predicted gain value {circumflex over (γ
  
  )}_gc.

16. Apparatus for decoding a sequence of coded subframes of a digitised sampled speech signal, the apparatus having means for decoding each of said subframes in turn, the means comprising:
- first signal processing means for recovering from the coded signal a quantised vector d(i) comprising at least one pulse, wherein the number m and position of pulses in the vector d(i) may vary between subframes;
  
  second signal processing means for recovering from the coded signal a quantised gain correction factor {circumflex over (γ
  
  )}_gc;
  
  third signal processing means for determining a scaling factor k which is a function of the ratio of a predetermined energy level to the energy in the quantised vector d(i);
  
  fourth signal processing means for determining a predicted gain value ĝ
  
  _con the basis of one or more previously processed subframes, and as a function of the energy E_cof the quantised vector d(i) or a further vector c(i) derived from the quantised vector when the amplitude of the vector is scaled by said scaling factor k; and
  
  correcting means for correcting the predicted gain value ĝ
  
  _cusing the quantised gain correction factor {circumflex over (γ
  
  )}_gcto provide a corrected gain value g_c; and
  
  scaling means for scaling the quantised vector d(i) or said further vector c(i) using the gain value g_cto generate an excitation vector synthesizing a residual signal {tilde over (s)} remaining in the original subframe speech signal after removal of substantially redundant information therefrom.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nokia Technologies Oy (Nokia Corporation)
Original Assignee
Nokia Mobile Phones UK Limited (Nokia Corporation)
Inventors
Ojala, Pasi
Primary Examiner(s)
Knepper, David D.

Application Number

US09/263,439
Time in Patent Office

1,328 Days
Field of Search

704/219-225, 704/200, 704/201
US Class Current

704/223
CPC Class Codes

G10L 19/002 Dynamic bit allocation for ...

G10L 19/06 Determination or coding of ...

Speech coding

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Speech coding

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links