Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same
First Claim
1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:
- defining at least two friendliness levels;
storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels;
extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and
generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics,wherein the prosodic model includes information comprises an “
opening”
speech act and sentence type, a “
request-information”
speech act and sentence type, a “
give-information”
speech act and sentence type, a “
request-action”
speech act and sentence type, and a “
closing”
speech act and sentence type.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for adjusting the friendliness of a synthesized speech and thus generating synthesized speech of various styles in a speech synthesis system are provided. The method includes the steps of defining at least two friendliness levels; storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels; extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics.
11 Citations
8 Claims
-
1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:
-
defining at least two friendliness levels; storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels; extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics, wherein the prosodic model includes information comprises an “
opening”
speech act and sentence type, a “
request-information”
speech act and sentence type, a “
give-information”
speech act and sentence type, a “
request-action”
speech act and sentence type, and a “
closing”
speech act and sentence type. - View Dependent Claims (2, 3, 4)
-
-
5. A speech synthesis method for adjusting a speech style, comprising the steps of:
-
(a) receiving a sentence with a marked friendliness level; (b) selecting a prosodic model based on the marked friendliness level of the sentence; and (c) generating a synthesized speech of the sentence with the marked friendliness level by obtaining speech segments from a synthesis unit database on the basis of the selected prosodic model, the synthesis unit database storing speech segments for each friendliness level wherein the selected prosodic model includes information of speech act and sentence type that comprises an “
opening”
speech act and sentence type, a “
request-information”
speech act and sentence type, a “
give-information”
speech act and sentence type, a “
request-action”
speech act and sentence type, and a “
closing”
speech act and sentence type. - View Dependent Claims (6, 7)
-
-
8. A speech synthesis apparatus for adjusting a speech style, comprising:
-
a prosodic model storage for storing prosodic models for each friendliness level, the prosodic models including sentential information and the corresponding prosodic characteristics for each friendliness level wherein the prosodic model includes an “
opening”
speech act and sentence type, a “
request-information”
speech act and sentence type, a “
give-information”
speech act and sentence type, a “
request-action”
speech act and sentence type, and a “
closing”
speech act and sentence type;a synthesis unit database for storing speech segments of each friendliness level; and a speech generator for selecting the prosodic model based on a marked friendliness level of an input sentence and obtaining the speech segments from the synthesis unit database on the basis of the selected prosodic model to generate a synthesized speech of the input sentence.
-
Specification