Method and apparatus for providing speech output for speech-enabled applications
First Claim
1. A method for providing, from a synthesis system, a speech output for a speech-enabled application, the method comprising:
- receiving from the speech-enabled application, at the synthesis system, a text input comprising a text transcription of a desired speech output;
selecting, using at least one computer system implementing the synthesis system, at least one audio recording provided by a developer of the speech-enabled application who is not a developer of the synthesis system, the at least one audio recording corresponding to at least a first portion of the text input; and
providing for the speech-enabled application, from the synthesis system, a speech output comprising the at least one audio recording.
7 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired speech output. The synthesis system selects one or more audio recordings corresponding to one or more portions of the text input. In one aspect, the synthesis system selects from audio recordings provided by a developer of the speech-enabled application. In another aspect, the synthesis system selects an audio recording of a speaker speaking a plurality of words. The synthesis system forms a speech output including the one or more selected audio recordings and provides the speech output for the speech-enabled application.
66 Citations
30 Claims
-
1. A method for providing, from a synthesis system, a speech output for a speech-enabled application, the method comprising:
-
receiving from the speech-enabled application, at the synthesis system, a text input comprising a text transcription of a desired speech output; selecting, using at least one computer system implementing the synthesis system, at least one audio recording provided by a developer of the speech-enabled application who is not a developer of the synthesis system, the at least one audio recording corresponding to at least a first portion of the text input; and providing for the speech-enabled application, from the synthesis system, a speech output comprising the at least one audio recording. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. Apparatus comprising at least one processor configured to:
-
receive from a speech-enabled application, at a synthesis system, a text input comprising a text transcription of a desired speech output; select, via the synthesis system, at least one audio recording provided by a developer of the speech-enabled application who is not a developer of the synthesis system, the at least one audio recording corresponding to at least a first portion of the text input; and provide for the speech-enabled application, from the synthesis system, a speech output comprising the at least one audio recording. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. At least one non-transitory computer-readable storage medium encoded with a plurality of computer-executable instructions that, when executed, perform a method for providing a speech output for a speech-enabled application from a synthesis system, the method comprising:
-
receiving from the speech-enabled application, at the synthesis system, a text input comprising a text transcription of a desired speech output; selecting, via the synthesis system, at least one audio recording provided by a developer of the speech-enabled application who is not a developer of the synthesis system, the at least one audio recording corresponding to at least a first portion of the text input; and providing for the speech-enabled application, from the synthesis system, a speech output comprising the at least one audio recording. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification