Apparatus and method for speech coding

US 7,383,176 B2
Filed: 04/01/2005
Issued: 06/03/2008
Est. Priority Date: 08/23/1999
Status: Expired due to Term

First Claim

Patent Images

1. A CELP-based speech encoder that performs encoding by decomposing one frame into a plurality of subframes, comprising:

an LPC synthesizer that obtains synthesized speech by filtering an adaptive excitation vector and a stochastic excitation vector stored in an adaptive codebook and in a stochastic codebook using LPC coefficients obtained from input speech;

a gain calculator that calculates gains of said adaptive excitation vector and said stochastic excitation vector;

a parameter coder that performs vector quantization of the adaptive excitation vector and the stochastic excitation vector obtained by comparing distortions between said input speech and said synthesized speech;

a pitch analyzer that calculates correlation values by performing pitch analyses of the plurality of subframes before performing an adaptive codebook search for a first subframe and finds a value most approximate to a pitch period using said correlation values; and

a search range setter that determines a lag search range using at least one of said correlation values and a value calculated using said correlation values.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

CELP-based speech encoder that performs encoding by decomposing one frame into a plurality of subframes, includes an LPC synthesizer that obtains synthesized speech by filtering an adaptive excitation vector and a stochastic excitation vector stored in an adaptive codebook and in an stochastic codebook using LPC coefficients obtained from input speech. A gain calculator calculates gains of the adaptive excitation vector and the stochastic excitation vector. A parameter coder performs vector quantization of the adaptive excitation vector and the stochastic excitation vector obtained by comparing distortions between the input speech and the synthesized speech. A pitch analyzer performs pitch analyses of a plurality of subframes in the frame respectively, before performing an adaptive codebook search for the first subframe, calculating correlation values and finding a value most approximate to the pitch period using the correlation values.

Citations

8 Claims

1. A CELP-based speech encoder that performs encoding by decomposing one frame into a plurality of subframes, comprising:
- an LPC synthesizer that obtains synthesized speech by filtering an adaptive excitation vector and a stochastic excitation vector stored in an adaptive codebook and in a stochastic codebook using LPC coefficients obtained from input speech;
  
  a gain calculator that calculates gains of said adaptive excitation vector and said stochastic excitation vector;
  
  a parameter coder that performs vector quantization of the adaptive excitation vector and the stochastic excitation vector obtained by comparing distortions between said input speech and said synthesized speech;
  
  a pitch analyzer that calculates correlation values by performing pitch analyses of the plurality of subframes before performing an adaptive codebook search for a first subframe and finds a value most approximate to a pitch period using said correlation values; and
  
  a search range setter that determines a lag search range using at least one of said correlation values and a value calculated using said correlation values.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The CELP-based speech encoder according to claim 1, wherein the search range setter determines the lag search range based on the at least one correlation value and the value most approximate to the pitch period obtained by said pitch analyzer.
  - 3. The CELP-based speech encoder according to claim 2, wherein said search range setter determines a provisional pitch that becomes the center of the search range using the correlation values and the value most approximate to the pitch period obtained by said pitch analyzer.
  - 4. The CELP-based speech encoder according to claim 3, wherein the search range setter sets a lag search section in a specified range around the provisional pitch.
  - 5. The CELP-based speech encoder according to claim 2, wherein the search range setter sets a lag search section by reducing a number of candidates for short pitch periods.
  - 6. The CELP-based speech encoder according to claim 2, wherein the search range setter performs a lag search within a set range during an adaptive codebook search.

7. A computer-readable recording medium that stores a speech encoding program, an adaptive codebook storing part used for synthesizing an excitation vector signal and a stochastic codebook storing a plurality of stochastic excitation vectors, said speech encoding program comprising:
- code for obtaining a synthesized speech by filtering an adaptive excitation vector and a stochastic excitation vector stored in said adaptive codebook and said stochastic codebook using decoded LPC coefficients obtained from an input speech;
  
  code for calculating gains of said adaptive excitation vector and said stochastic excitation vector;
  
  code for performing vector quantization on the adaptive excitation vector and the stochastic excitation vector determined by comparing distortions between said input speech and said synthesized speech;
  
  code for calculating correlation values by performing pitch analyses of a plurality of subframes in a processing frame before performing an adaptive codebook search of a first subframe and calculating a value most approximate to a pitch period using said correlation values; and
  
  code for determining a lag search range using at least one of said correlation values and a value calculated using said correlation values.

8. A CELP-based speech encoding method for performing encoding by decomposing one frame into a plurality of subframes, comprising:
- obtaining a synthesized speech by filtering an adaptive excitation vector and by filtering a stochastic excitation vector stored in an adaptive codebook and in a stochastic codebook using decoded LPC coefficients obtained from an input speech;
  
  calculating gains of said adaptive excitation vector and said stochastic excitation vector;
  
  performing vector quantization on the adaptive excitation vector and the stochastic excitation vector obtained by comparing distortions between said input speech and said synthesized speech;
  
  calculating correlation values by performing pitch analyses of the plurality of subframes before performing an adaptive codebook search for a first subframe, and finding a value most approximate to the pitch period using said correlation values; and
  
  determining a lag search range using at least one of said correlation values and a value calculated using said correlation values.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
III Holdings 12, LLC (Intellectual Ventures LLC)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Morii, Toshiyuki, Yasunaga, Kazutoshi
Primary Examiner(s)
ABEBE, DANIEL DEMELASH

Application Number

US11/095,605
Publication Number

US 20050197833A1
Time in Patent Office

1,159 Days
Field of Search

704/219, 704/222, 704/223
US Class Current

704/219
CPC Class Codes

G10L 19/083   the excitation function bei...

G10L 19/09   Long term prediction, i.e. ...

G10L 19/16   Vocoder architecture

Apparatus and method for speech coding

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus and method for speech coding

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links