Method for prosody generation by unit selection from an imitation speech database

US 20030028376A1
Filed: 07/31/2001
Published: 02/06/2003
Est. Priority Date: 07/31/2001
Status: Active Grant

First Claim

Patent Images

1. A computer implemented method for prosody generation, comprising the steps of:

preparing an imitation speech database using recordings of natural human speech;

converting text to synthesized speech using a rule based speech synthesizer;

selecting prosody units from said imitation speech database to match said synthesized speech; and

concatenating said selected prosody units and generating a final prosody.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method is provided for prosody generation by unit selection from an imitation speech database. A rule based method of text to speech conversion is used to produce a set of intonation events by selecting syllables on which there would be either a pitch peak or dip (or a combination), and produces the parameters to generate a pitch curve of the event. The synthetic pitch curve shape generated by the rule based method is then utilized to select the best matching units from an imitation speech database of a speaker'"'"'s prosody, which are then concatenated to reduce the final prosody.

24 Citations

View as Search Results

12 Claims

1. A computer implemented method for prosody generation, comprising the steps of:
- preparing an imitation speech database using recordings of natural human speech;
  
  converting text to synthesized speech using a rule based speech synthesizer;
  
  selecting prosody units from said imitation speech database to match said synthesized speech; and
  
  concatenating said selected prosody units and generating a final prosody.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The method according to claim 1, wherein said rule based computer synthesizer uses a tone sequence prosody model.
  - 3. The method according to claim 1, wherein said step of selecting prosody units from said imitation speech prosody database includes a cost function algorithm using distortion and concatenation costs.
  - 4. The method according to claim 1, wherein said step of selecting prosody units from said imitation speech prosody database includes associating each syllable in said synthesized speech with an event including pitch events.
  - 5. The method according to claim 1, wherein said step of concatenating said selected prosody units and generating a final prosody includes an F0 smoothing function performed at concatenation points between selected prosody units.

6. A computer implemented method for prosody generation, comprising the steps of:
- preparing an imitation speech prosody database including;
  
  converting training text to synthesized speech using a rule based computer synthesizer;
  
  recording human speech imitating said synthesized speech;
  
  time aligning said recorded human speech with said synthesized speech and extracting features from said recorded speech for syllables in which intonation events occur and generating said imitation speech prosody database; and
  
  generating speech prosody from text including;
  
  converting text to synthesized speech using a rule based synthesizer;
  
  selecting prosody units from said imitation speech prosody database to match said synthesized speech; and
  
  concatenating said selected prosody units and generating a final prosody.
- View Dependent Claims (7, 8, 9, 10, 11)
- - 7. The method according to claim 6, wherein said rule based synthesizer uses a tone sequence prosody model.
  - 8. The method according to claim 6, wherein said step of selecting prosody units from said imitation speech prosody database includes a cost function algorithm using distortion and concatenation costs.
  - 9. The method according to claim 6, wherein said step of selecting prosody units from said imitation speech prosody database includes associating each syllable in said synthesized speech with an event including pitch events.
  - 10. The method according to claim 6, wherein said step of concatenating said selected prosody units and generating a final prosody includes an F0 smoothing function performed at concatenation points between selected prosody units.
  - 11. The method according to claim 6, wherein said step of time aligning said recorded human speech with said synthesized speech is performed using a dynamic time warp aligner.

12. A speech generation processor for processing input text to speech, comprising:
- an imitation speech database including prosodic units from imitation speech;
  
  a rule based synthesizer module for generating synthesized speech curves for input text;
  
  an imitation speech prosody selection module for selecting prosodic units from said imitation speech database with said synthesized speech curves and concatenating said selected prosodic units together for speech generation; and
  
  an audible device for receiving a speech generation signal from said imitation speech prosody selection module and generating audible speech.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Panasonic Intellectual Property Corporation of America (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Meron, Joram

Granted Patent

US 6,829,581 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/258
CPC Class Codes

G10L 13/04 Details of speech synthesis...

G10L 13/06 Elementary speech units use...

Method for prosody generation by unit selection from an imitation speech database

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

24 Citations

12 Claims

Specification

Solutions

Use Cases

Quick Links

Method for prosody generation by unit selection from an imitation speech database

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

24 Citations

12 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links