Perceptual speech coding using prediction residuals, having harmonic magnitude codebook for voiced and waveform codebook for unvoiced frames

US 5,848,387 A
Filed: 10/25/1996
Issued: 12/08/1998
Est. Priority Date: 10/26/1995
Status: Expired due to Term

First Claim

Patent Images

1. A speech encoding method for an input speech signal divided on the time axis into blocks as units and for encoding the divided signal on a block-by-block basis, comprising the steps of:

finding short-term prediction residuals at least for a voiced portion of the input speech signal;

finding sinusoidal analytic encoding parameters based on the short-term prediction residuals thus found;

performing perceptually weighted vector quantization for each harmonic magnitude on the sinusoidal analytic encoding parameters to produce an encoded voiced portion of the input speech signal; and

encoding an unvoiced portion of the input speech signal by waveform encoding to produce an encoded unvoiced portion of the input speech signal.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech encoding method and apparatus for encoding an input speech signal on a block-by-block or frame-by-frame basis wherein short-term prediction residuals are found and then sinusoidal analytic encoding parameters are produced based on those short-term prediction residuals. Perceptually weighted vector quantization is performed for voiced blocks or frames by encoding their sinusoidal frequency or analytic harmonic magnitudes and, in the case of unvoiced blocks or frames, the time waveforms of the unvoiced blocks are encoded.

46 Citations

View as Search Results

8 Claims

1. A speech encoding method for an input speech signal divided on the time axis into blocks as units and for encoding the divided signal on a block-by-block basis, comprising the steps of:
- finding short-term prediction residuals at least for a voiced portion of the input speech signal;
  
  finding sinusoidal analytic encoding parameters based on the short-term prediction residuals thus found;
  
  performing perceptually weighted vector quantization for each harmonic magnitude on the sinusoidal analytic encoding parameters to produce an encoded voiced portion of the input speech signal; and
  
  encoding an unvoiced portion of the input speech signal by waveform encoding to produce an encoded unvoiced portion of the input speech signal.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The speech signal encoding method as claimed in claim 1 wherein it is judged whether the input speech signal is voiced or unvoiced and, based on the results of judgment, the portion of the input speech signal found to be voiced is processed with said sinusoidal analytic encoding and the portion of the input speech signal found to be unvoiced is vector quantized by a closed-loop optimum vector search using an analysis-by-synthesis method.
  - 3. The speech signal encoding method as claimed in claim 1 wherein one of the analytic encoding parameters comprises data representing a spectral envelope that is used as the sinusoidal analysis parameter used in the step of performing perceptually weighted vector quantization.
  - 4. The speech encoding method as claimed in claim 1 wherein the step of performing perceptually weighted vector quantization includes:
    - at least comprising;
      
      performing a first vector quantization operation on the input speech signal; and
      
      performing a second quantization step of quantizing a quantization error vector produced at the time of performing said first vector quantization.
  - 5. The speech signal encoding method as claimed in claim 4 wherein for a low bit rate an output of the first vector quantization step is taken out, and for a high bit rate an output of said first vector quantization step and an output of said second vector quantization step are taken out.

6. A speech encoding apparatus receiving an input speech signal divided on the time axis into blocks for encoding the divided signal on a block-by-block basis, comprising:
- means for finding short-term prediction residuals of at least a voiced portion of the input speech signal;
  
  means for finding sinusoidal analytic encoding parameters including a spectral harmonic magnitude envelope from the short-term prediction residuals thus found;
  
  means for performing perceptually weighted vector quantization at least on the spectral harmonic magnitude envelope; and
  
  means for encoding an unvoiced portion of the input speech signal by waveform encoding.

7. A speech encoding apparatus receiving an input speech signal divided on the time axis into blocks for encoding the signal on a block-by-block basis, comprising:
- means for finding short-term prediction residuals at least for a voiced portion of the input speech signal;
  
  means for finding linear spectral pairs of encoding parameters including a spectral magnitude harmonic envelope from the short-term prediction residuals; and
  
  means performing perceptually weighted multiple-stage vector quantization on the linear spectral pairs of encoding parameters limited in the frequency axis.

8. A portable radio terminal device comprising:
- amplifying means for amplifying input speech signals;
  
  A/D converting means for A/D conversion of the amplified speech signals;
  
  speech encoding means for encoding a speech signal output from said A/D converting means;
  
  transmission path encoding means for channel encoding the encoded speech signal;
  
  modulating means for modulating an output of said transmission path encoding means;
  
  D/A converting means for D/A converting the resulting modulated signal to an analog signal; and
  
  amplifier means for amplifying the analog signal from said D/A converting means and supplying the resulting amplified signal to an antenna, whereinsaid speech encoding means includesmeans for finding a short-term prediction residual of at least a voiced portion of said input speech signal;
  
  means for finding sinusoidal analytic encoding parameters from the short-term prediction residuals thus found;
  
  means for performing perceptually weighted vector quantization on said sinusoidal analytic encoding parameters; and
  
  means for encoding an unvoiced portion of said input speech signal by waveform encoding.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sony Corporation (Sony Group Corp.)
Original Assignee
Sony Corporation (Sony Group Corp.)
Inventors
Nishiguchi, Masayuki, Matsumoto, Jun, Omori, Shiro, Iijima, Kazuyuki
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
Smits, Talivaldis Ivars

Application Number

US08/736,987
Time in Patent Office

774 Days
Field of Search

704/214, 704/219, 704/220, 704/222, 704/223
US Class Current

704/214
CPC Class Codes

G10L 19/06 Determination or coding of ...

Perceptual speech coding using prediction residuals, having harmonic magnitude codebook for voiced and waveform codebook for unvoiced frames

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

46 Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Perceptual speech coding using prediction residuals, having harmonic magnitude codebook for voiced and waveform codebook for unvoiced frames

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

46 Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links