Customizing the speaking style of a speech synthesizer based on semantic analysis
First Claim
Patent Images
1. A method for generating synthesized speech, comprising:
- receiving input text;
determining a topic for the input text;
determining a speaking style based on the identified topic, where the speaking style correlates to prosodic parameters; and
converting the input text to audible speech using the prosodic parameters.
4 Assignments
0 Petitions
Accused Products
Abstract
A method is provided for customizing the speaking style of a speech synthesizer. The method includes: receiving input text; determining semantic information for the input text; determining a speaking style for rendering the input text based on the semantic information; and customizing the audible speech output of the speech synthesizer based on the identified speaking style.
32 Citations
12 Claims
-
1. A method for generating synthesized speech, comprising:
-
receiving input text;
determining a topic for the input text;
determining a speaking style based on the identified topic, where the speaking style correlates to prosodic parameters; and
converting the input text to audible speech using the prosodic parameters. - View Dependent Claims (2, 3, 4)
-
-
5. A method for customizing the speaking style of a text-to-speech synthesizer system, comprising:
-
receiving input text;
determining semantic information for the input text;
determining a speaking style for rendering the input text based on the semantic information; and
customizing an output parameter of a multimedia user interface of the text-to-speech synthesizer system based on the speaking style, where the text-to-speech synthesizer system is operable to render audible speech which correlates to the input text. - View Dependent Claims (6, 7, 8, 9, 10, 11)
-
-
12. A text-to-speech synthesizer system, comprising a text analyzer receptive of input text and operable to determine semantic information for the input text;
-
a style selector adapted to receive semantic information from the text analyzer and operable to determine a speaking style for rending the input text based on the semantic information, where the selected speaking style correlates to one or more prosodic attributes;
a phonetic analyzer adapted to receive input text from the text analyzer and operable to convert the input text into corresponding phoneme data;
a prosodic analyzer adapted to receive phoneme data from the phonetic analyzer and the prosodic attributes from the style selector, the prosodic analyzer further operable to apply the prosodic attributes to the phoneme data to form a prosodic representation of the phoneme data; and
a speech synthesizer adapted to receive the prosodic representation of the phoneme data from the prosodic analyzer and operable to generate audible speech.
-
Specification