Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same

US 7,792,673 B2
Filed: 11/07/2006
Issued: 09/07/2010
Est. Priority Date: 11/08/2005
Status: Expired due to Fees

First Claim

Patent Images

1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:

defining at least two friendliness levels;

storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels;

extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F₀of the sentence, with respect to the recorded speech data; and

generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics,wherein the prosodic model includes information comprises an “

opening”

speech act and sentence type, a “

request-information”

speech act and sentence type, a “

give-information”

speech act and sentence type, a “

request-action”

speech act and sentence type, and a “

closing”

speech act and sentence type.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An apparatus and method for adjusting the friendliness of a synthesized speech and thus generating synthesized speech of various styles in a speech synthesis system are provided. The method includes the steps of defining at least two friendliness levels; storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels; extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F₀of the sentence, with respect to the recorded speech data; and generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics.

11 Citations

View as Search Results

8 Claims

1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:
- defining at least two friendliness levels;
  
  storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels;
  
  extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F₀of the sentence, with respect to the recorded speech data; and
  
  generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics,wherein the prosodic model includes information comprises an “
  
  opening”
  
  speech act and sentence type, a “
  
  request-information”
  
  speech act and sentence type, a “
  
  give-information”
  
  speech act and sentence type, a “
  
  request-action”
  
  speech act and sentence type, and a “
  
  closing”
  
  speech act and sentence type.
- View Dependent Claims (2, 3, 4)
- - 2. The method according to claim 1, wherein the “
    - request-action”
      
      speech act and sentence type is classified into a “
      
      wh-question” and
      
      a “
      
      yes-no question”
      
      .
  - 3. The method according to claim 1 wherein the prosodic model further comprises a “
    - propose-action”
      
      speech act and sentence type, a “
      
      expressive”
      
      speech act and sentence type, a “
      
      commit”
      
      speech act and sentence type, a “
      
      call”
      
      speech act and sentence type, a “
      
      acknowledge”
      
      speech act and sentence type, a “
      
      statement”
      
      speech act and sentence type, a “
      
      command”
      
      speech act and sentence type, a “
      
      proposition”
      
      speech act and sentence type, and a “
      
      exclamation”
      
      speech act and sentence type.
  - 4. The method according to claim 1, wherein the prosodic characteristic includes the characteristics of the average F₀value of the sentence and the sentence-final intonation type for each of the friendliness levels.

5. A speech synthesis method for adjusting a speech style, comprising the steps of:
- (a) receiving a sentence with a marked friendliness level;
  
  (b) selecting a prosodic model based on the marked friendliness level of the sentence; and
  
  (c) generating a synthesized speech of the sentence with the marked friendliness level by obtaining speech segments from a synthesis unit database on the basis of the selected prosodic model, the synthesis unit database storing speech segments for each friendliness level wherein the selected prosodic model includes information of speech act and sentence type that comprises an “
  
  opening”
  
  speech act and sentence type, a “
  
  request-information”
  
  speech act and sentence type, a “
  
  give-information”
  
  speech act and sentence type, a “
  
  request-action”
  
  speech act and sentence type, and a “
  
  closing”
  
  speech act and sentence type.
- View Dependent Claims (6, 7)
- - 6. The speech synthesis method according to claim 5, wherein the synthesis unit database stores sentence data and the corresponding speech segments recorded according to each friendliness level, the sentence data including information of speech act, a sentence type, or a sentence final verbal-ending or a combination thereof according to each friendliness level.
  - 7. The speech synthesis method according to claim 5, wherein the step (c) includes the steps of:
    - (c1) extracting the speech segments from the synthesis unit database using prosodic information of the sentence based on the selected prosodic model; and
      
      (c2) synthesizing the extracted speech segments.

8. A speech synthesis apparatus for adjusting a speech style, comprising:
- a prosodic model storage for storing prosodic models for each friendliness level, the prosodic models including sentential information and the corresponding prosodic characteristics for each friendliness level wherein the prosodic model includes an “
  
  opening”
  
  speech act and sentence type, a “
  
  request-information”
  
  speech act and sentence type, a “
  
  give-information”
  
  speech act and sentence type, a “
  
  request-action”
  
  speech act and sentence type, and a “
  
  closing”
  
  speech act and sentence type;
  
  a synthesis unit database for storing speech segments of each friendliness level; and
  
  a speech generator for selecting the prosodic model based on a marked friendliness level of an input sentence and obtaining the speech segments from the synthesis unit database on the basis of the selected prosodic model to generate a synthesized speech of the input sentence.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Electronics and Telecommunications Research Institute
Original Assignee
Electronics and Telecommunications Research Institute
Inventors
Kim, Sang Hun, Oh, Seung Shin, Lee, Young Jik
Primary Examiner(s)
Sked; Matthew J

Application Number

US11/593,852
Publication Number

US 20070106514A1
Time in Patent Office

1,400 Days
Field of Search

None
US Class Current

704/266
CPC Class Codes

G10L 13/033 Voice editing, e.g. manipul...

G10L 13/04 Details of speech synthesis...

Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

11 Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

11 Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links