Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same
First Claim
1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:
- defining at least two friendliness levels;
storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels;
extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and
generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for adjusting the friendliness of a synthesized speech and thus generating synthesized speech of various styles in a speech synthesis system are provided. The method includes the steps of defining at least two friendliness levels; storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels; extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics.
15 Citations
8 Claims
-
1. A method of generating a prosodic model for controlling a speech style, comprising the steps of:
-
defining at least two friendliness levels;
storing recorded speech data of sentences, the sentences being made up according to each of the friendliness levels;
extracting at least one of prosodic characteristics for each of the friendliness levels from the recorded speech data, said prosodic characteristics including at least one of a sentence-final intonation type, boundary intonation types of intonation phrases in the sentence, and an average value of F0 of the sentence, with respect to the recorded speech data; and
generating a prosodic model for each of the friendliness levels by statistically modeling the at least one of the prosodic characteristics. - View Dependent Claims (2, 3, 4)
-
-
5. A speech synthesis method for adjusting a speech style, comprising the steps of:
-
(a) receiving a sentence with a marked friendliness level;
(b) selecting a prosodic model based on the marked friendliness level of the sentence; and
(c) generating a synthesized speech of the sentence with the marked friendliness level by obtaining speech segments from a synthesis unit database on the basis of the selected prosodic model, the synthesis unit database storing speech segments for each friendliness level. - View Dependent Claims (6, 7)
-
-
8. A speech synthesis apparatus for adjusting a speech style, comprising:
-
a prosodic model storage for storing prosodic models for each friendliness level, the prosodic models including sentential information and the corresponding prosodic characteristics for each friendliness level;
a synthesis unit database for storing speech segments of each friendliness level; and
a speech generator for selecting the prosodic model based on a marked friendliness level of an input sentence and obtaining the speech segments from the synthesis unit database on the basis of the selected prosodic model to generate a synthesized speech of the input sentence.
-
Specification