Method for encoding speech wherein pitch periods are changed based upon input speech signal

US 6,427,135 B1
Filed: 10/27/2000
Issued: 07/30/2002
Est. Priority Date: 03/17/1997
Status: Expired due to Fees

First Claim

Patent Images

1. A method for encoding speech comprising the steps of:

obtaining first pitch periods of an input speech signal;

changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;

determining encoding sections corresponding to said second pitch periods, respectively;

generating an excitation signal by which distortion of a synthesized speech signal is minimized for each of said encoding sections, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and

outputting at least information representing said changed pitch periods and information on said synthesized speech signal as encoded data.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for encoding speech wherein an input speech signal is separated by a component separator into a first component mainly constituted by speech and a second component mainly constituted by a background noise at each predetermined unit of time, a bit allocation selector selects bit allocation for each component based on the first and second components from among a plurality of predetermined candidates for bit allocation, a speech encoder and a noise encoder encode the first and second components from the component separator based on the bit allocation according to predetermined different methods for encoding, and a multiplexer multiplexes encoded data of the first and second components and information on the bit allocation and outputs them as transmitted encoded data.

53 Citations

View as Search Results

10 Claims

1. A method for encoding speech comprising the steps of:
- obtaining first pitch periods of an input speech signal;
  
  changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;
  
  determining encoding sections corresponding to said second pitch periods, respectively;
  
  generating an excitation signal by which distortion of a synthesized speech signal is minimized for each of said encoding sections, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and
  
  outputting at least information representing said changed pitch periods and information on said synthesized speech signal as encoded data.
- View Dependent Claims (2)
- - 2. The method for encoding speech according to claim 1, further comprising the step of concatenating said second pitch periods which are at least partially adjacent to each other to obtain concatenated pitch periods,

3. A method for encoding speech comprising the steps of:
- obtaining synthesis filter characteristic information representing the transfer characteristics of a synthesis filter which receives an excitation signal and generates a synthesized speech signal;
  
  obtaining first pitch periods of an input speech signal;
  
  changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;
  
  determining encoding sections corresponding to said second pitch periods, respectively;
  
  generating said excitation signal by which distortion of said synthesized speech signal is minimized for each of said encoding sections; and
  
  outputting at least said synthesis filter characteristic information, information representing said second pitch periods and information representing said excitation signal as encoded data.
- View Dependent Claims (4)
- - 4. The method for encoding speech according to claim 3, further comprising the step of concatenating said second pitch periods which are at least partially adjacent to each other to obtain concatenated pitch periods,

5. A method for encoding speech comprises:
- setting a plurality of pitch marks in each frame of an input speech signal, each of the pitch marks indicating a position in the frame at which a pitch wave form is to be put;
  
  obtaining a plurality of pitch periods corresponding pitch marks, respectively, the pitch periods being changed according to condition of the input speech signal;
  
  generating an excitation signal by which distortion of a synthesized speech signal is minimized, for each of said pitch periods, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and
  
  outputting at least information representing said pitch periods and information on said synthesized speech signal as encoded data.
- View Dependent Claims (6, 7, 8)
- - 6. A method according to claim 5, wherein the step of generating an excitation signal includes putting pitch waveforms on the pitch marks and applying a gain thereto to generate the excitation signal.
  - 7. A method according to claim 5, wherein the step of generating an excitation signal includes calculating an error between the synthesized speech signal and the input speech signal, weighting the error with a perceptual weighting method, and selecting an excitation signal for which distortion of the input speech signal is minimum.
  - 8. A method according to claim 6, which includes generating the pitch waveforms by sorting a plurality of template pitch waveforms in a codebook in advance and selecting the optimum pitch waveforms from the template pitch waveforms through closed loop search.

9. A method for encoding speech comprising the steps of:
- obtaining local pitch periods representing time lengths of one-pitch waveforms of an input speech signal from said input speech signal;
  
  determining encoding sections based on said local pitch periods;
  
  generating a synthesized speech signal for which distortion from said input speech signal is minimized in each of said encoding sections;
  
  outputting at least information representing said local pitch periods and information on said synthesized speech signal as encoded data; and
  
  concatenating said local pitch periods which are at least partially adjacent to each other to obtain local concatenated pitch periods, said step of generating a synthesized speech signal comprising the steps of determining encoding sections based on said local pitch periods and said local concatenated pitch periods and generating a synthesized speech signal for which distortion from said input speech signal is minimized in each of said encoding sections, said step of outputting encoded data comprising the step of outputting at least information representing said local pitch periods, information representing said local concatenated pitch periods and information on said synthesized speech signal as encoded data.

10. A method for encoding speech comprising the steps of:
- obtaining synthesis filter characteristic information representing the transfer characteristics of a synthesis filter which receives the input of an excitation signal and generates a synthesized speech signal and obtaining local pitch periods representing time lengths of one-pitch waveforms of an input speech signal from said input speech signal;
  
  determining encoding sections based on said local pitch periods;
  
  generating said excitation signal for which distortion of said synthesized speech signal is minimized in each of said encoding sections;
  
  outputting at least said synthesis filter characteristic information, information representing said local pitch periods and information representing said excitation signal as encoded data;
  
  concatenating said local pitch periods which are at least partially adjacent to each other to obtain local concatenated pitch periods, said step of generating an excitation signal comprising the steps of determining encoding sections based on said local pitch periods and said local concatenated pitch periods and generating said excitation signal for which distortion of said synthesized speech signal is minimized in each of said encoding sections, said step of outputting encoded data comprising the step of outputting at least said synthesis filter characteristic information, information representing said local pitch periods, information representing said local concatenated pitch periods and information representing said excitation signal as encoded data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation)
Original Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation)
Inventors
Oshikiri, Masahiro, Miseki, Kimio, Akamine, Masami, Amada, Tadashi
Primary Examiner(s)
Chawan, Vijay B

Application Number

US09/696,962
Time in Patent Office

641 Days
Field of Search

704/258, 704/207, 704/200, 704/1, 704/229, 704/205, 704/208, 704/222, 704/219, 704/230, 704/206, 704/500, 704/501, 704/203, 704/211, 704/214, 704/220
US Class Current

704/258
CPC Class Codes

G10L 19/012 Comfort noise or silence co...

G10L 19/02 using spectral analysis, e....

Method for encoding speech wherein pitch periods are changed based upon input speech signal

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

53 Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Method for encoding speech wherein pitch periods are changed based upon input speech signal

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

53 Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links