Personalized text-to-speech services
First Claim
1. A method comprising:
- receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion;
selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates;
accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message;
generating variable speech corresponding to the variable text portion of the textual message; and
merging the pre-recorded speech and the variable speech in an order defined by the speech template.
6 Assignments
0 Petitions
Accused Products
Abstract
A personalized text-to-speech (pTTS) system provides a method for converting text data to speech data utilizing a pTTS template representing the voice characteristics of an individual. A memory stores executable program code that converts text data to speech data. Text data represents a textual message directed to a system user and speech data represents a spoken form of text data having the characteristics of an individual'"'"'s voice. A processor executes the program code, and a storage device stores a pTTS template and may store speech data. The pTTS system can be used to provide various services that provide immediate spoken presentation of the speech data converted from text data and/or combine stored speech data with generated speech data for spoken presentation.
14 Citations
19 Claims
-
1. A method comprising:
-
receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion; selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates; accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message; generating variable speech corresponding to the variable text portion of the textual message; and merging the pre-recorded speech and the variable speech in an order defined by the speech template. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion; selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates; accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message; generating variable speech corresponding to the variable text portion of the textual message; and merging the pre-recorded speech and the variable speech in an order defined by the speech template. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computer-readable device having instructions stored, which, when executed by a computing device, cause the computing device to perform operations comprising:
-
receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion; selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates; accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message; generating variable speech corresponding to the variable text portion of the textual message; and merging the pre-recorded speech and the variable speech in an order defined by the speech template. - View Dependent Claims (18, 19)
-
Specification