METHOD AND APPARATUS FOR PROVIDING SPEECH OUTPUT FOR SPEECH-ENABLED APPLICATIONS
7 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired speech output. The synthesis system selects one or more audio recordings corresponding to one or more portions of the text input. In one aspect, the synthesis system selects from audio recordings provided by a developer of the speech-enabled application. In another aspect, the synthesis system selects an audio recording of a speaker speaking a plurality of words. The synthesis system forms a speech output including the one or more selected audio recordings and provides the speech output for the speech-enabled application.
-
Citations
41 Claims
-
1-30. -30. (canceled)
-
31. A method for providing a speech output for a speech-enabled application, the method comprising:
-
receiving from the speech-enabled application a text input comprising a text transcription of a desired speech output; selecting, using at least one computer system, an audio recording of a speaker speaking a plurality of words, the audio recording corresponding to at least a first portion of the text input; and providing for the speech-enabled application a speech output comprising the audio recording. - View Dependent Claims (32, 33, 34, 35)
-
-
36. A method for providing a speech output for a speech-enabled application, the method comprising:
-
receiving at least one input specifying a desired speech output; selecting, using at least one computer system, at least one audio recording corresponding to at least a first portion of the desired speech output, the at least one audio recording being selected based at least in part on at least one constraint regarding a desired contrastive stress pattern in the desired speech output, the at least one constraint being indicated by metadata associated with the at least one audio recording; and providing for the speech-enabled application a speech output comprising the at least one audio recording.
-
-
37. At least one non-transitory computer-readable storage medium encoded with a plurality of computer-executable instructions that, when executed, perform a method for providing a speech output for a speech-enabled application, the method comprising:
-
receiving from the speech-enabled application a text input comprising a text transcription of a desired speech output; selecting an audio recording of a speaker speaking a plurality of words, the audio recording corresponding to at least a first portion of the text input; and providing for the speech-enabled application a speech output comprising the audio recording. - View Dependent Claims (38, 39, 40, 41)
-
Specification