Method and system for intuitive text-to-speech synthesis customization
First Claim
Patent Images
1. A system for tuning the text-to-speech conversion process, the system comprising:
- a text-to-speech engine, said text-to-speech engine receiving at least one text-input and converting said text-input into a processed representation, said processed representation including at least one speech feature associated with at least one segment of said representation; and
a visual editing interface, said visual editing interface displaying said processed representation using at least one graphical indicator on an output device, wherein said segment is displayed on said output device using said graphical indicator corresponding to said speech feature.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for tuning the text-to-speech conversion process having a text-to-speech engine that converts the input text into a processed text form which includes speech features. A visual editing interface displaying the processed text form using graphical indicators on an output device to allow a user to edit the text and graphical indicators to modify the speech features of the text input.
33 Citations
28 Claims
-
1. A system for tuning the text-to-speech conversion process, the system comprising:
-
a text-to-speech engine, said text-to-speech engine receiving at least one text-input and converting said text-input into a processed representation, said processed representation including at least one speech feature associated with at least one segment of said representation; and
a visual editing interface, said visual editing interface displaying said processed representation using at least one graphical indicator on an output device, wherein said segment is displayed on said output device using said graphical indicator corresponding to said speech feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for providing a text-to-speech interface, the system comprising:
-
a visual interface connected to a text-to-speech engine; and
at least one communication channel connecting said visual interface to said text-to-speech engine, said text-to-speech engine communicating with said visual interface over said communication channel by sending and receiving at least one data segment in a format. - View Dependent Claims (19, 20, 21)
-
-
22. A method for visual tuning text-to-speech conversion process, the method comprising:
-
converting an input-text to a processed representation using a text-to-speech engine, said processed representation including at least one speech feature of said input-text;
displaying said processed representation on a visual editing interface connected to said text-to-speech engine, said speech feature of said processed representation being displayed in a corresponding graphical form; and
providing an editing function in said visual editing interface to a user for modifying said speech feature in said graphical form. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
Specification