EMOTIONAL-SPEECH SYNTHESIZING DEVICE, METHOD OF OPERATING THE SAME AND MOBILE TERMINAL INCLUDING THE SAME
First Claim
1. An emotional-speech synthesizing unit that is configured to:
- calculate in stages degrees of similarity in the emotion and the rhythm between the adjacent words based on context information on the recognized sentence,apply weight to a phoneme candidate corresponding to the each word based on the degrees of the similarity and the probability vector, select the phoneme candidate that has a minimum target pitch, minimum duration time, a minimum distance value of a target pitch contour, andsynthesize an emotional speech that corresponds to the recognized sentence in optimal units.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided is an emotional-speech synthesizing device including: a sentence recognition unit that recognizes a sentence that is input; a word emotion determination unit that calculates probability vector of an emotion that is pre-defined for each word that makes up the recognized sentence and estimates the emotion and a rhythm based on the probability vector; and an emotional-speech synthesizing unit. The emotional-speech synthesizing unit calculates in stages degrees of similarity in the emotion and the rhythm between the adjacent words based on context information on the recognized sentence, applies weight to a phoneme candidate corresponding to the each word based on the degrees of the similarity and the probability vector, selects the phoneme candidate that has a minimum target pitch, minimum duration time, a minimum distance value of a target pitch contour, and thus synthesizes an emotional speech that corresponds to the recognized sentence in optimal units.
-
Citations
20 Claims
-
1. An emotional-speech synthesizing unit that is configured to:
-
calculate in stages degrees of similarity in the emotion and the rhythm between the adjacent words based on context information on the recognized sentence, apply weight to a phoneme candidate corresponding to the each word based on the degrees of the similarity and the probability vector, select the phoneme candidate that has a minimum target pitch, minimum duration time, a minimum distance value of a target pitch contour, and synthesize an emotional speech that corresponds to the recognized sentence in optimal units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of operating an emotional-speech synthesizing device, comprising:
-
recognizing a sentence that is input; calculating probability vector of an emotion that is pre-defined for each word that makes up the recognized sentence; estimating the emotion and a rhythm based on the calculated probability vector; calculating in stages degrees of similarity in the emotion and the rhythm between the adjacent words based on context information on the recognized sentence and applying weight to a phoneme candidate corresponding to the each word based on the degrees of the similarity and the probability vector; and selecting the phoneme candidate that has a minimum target pitch, minimum duration time, a minimum distance value of a target pitch contour, and thus synthesizing an emotional speech that corresponds to the recognized sentence in optimal units. - View Dependent Claims (10, 11, 12)
-
-
13. A mobile terminal comprising:
-
an input unit that is configured in such a manner that a control command for outputting an emotional speech to the input unit; a controller that is configured to; recognize at least one sentence that is input, based on the control command, calculate probability vector of an emotion that is pre-defined for each word that makes up the recognized sentence, estimate the emotion and a rhythm based on the probability vector, calculate in stages degrees of similarity in the emotion and the rhythm between the adjacent words based on context information on the recognized sentence, apply weight to a phoneme candidate corresponding to the each word based on the degrees of the similarity and the probability vector, select the phoneme candidate that has a minimum target pitch, minimum duration time, a minimum distance value of a target pitch contour, and synthesize an emotional speech that corresponds to the recognized sentence in optimal units, and a sound output unit that is configured to output the emotional speech that is synthesized by the controller. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification