System and method for distributed text-to-speech synthesis and intelligibility
First Claim
1. A system for distributed text-to-speech synthesis comprising:
- a guest device configured for transmitting text input in the form of a text string;
a host device configured to receive the text string and process the text string by converting the text string to an audio index representation of an audio file associated with the text string, the host device comprising;
a text analyzer configurable to process the text string to produce phonetic information and linguistic information;
a prosody analyzer configurable to generate prosodic information based on at least the phonetic information and linguistic information,wherein the converting at the host device being based on at least the phonetic information and prosodic information, and includes identifying audio units from a first audio unit synthesis inventory on the host device,wherein the guest device comprises;
a second audio unit synthesis inventory where audio units are selected from and selection of audio units from the second audio unit synthesis inventory being based on the audio index representation sent from the host device; and
a unit-concatenative module for concatenating the selected audio units.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for distributed text-to-speech synthesis and intelligibility, and more particularly to distributed text-to-speech synthesis on handheld portable computing devices that can be used for example to generate intelligible audio prompts that help a user interact with a user interface of the handheld portable computing device. The text-to-speech distributed system 70 receives a text string from the guest devices and comprises a text analyzer 72, a prosody analyzer 74, a database 14 that the text analyzer and prosody analyzer refer to, and a speech synthesizer 80. Elements of the speech synthesizer 80 are resident on the host device and the guest device and an audio index representation of the audio file associated with the text string is produced at the host device and transmitted to the guest device for producing the audio file at the guest device.
-
Citations
3 Claims
-
1. A system for distributed text-to-speech synthesis comprising:
-
a guest device configured for transmitting text input in the form of a text string; a host device configured to receive the text string and process the text string by converting the text string to an audio index representation of an audio file associated with the text string, the host device comprising; a text analyzer configurable to process the text string to produce phonetic information and linguistic information; a prosody analyzer configurable to generate prosodic information based on at least the phonetic information and linguistic information, wherein the converting at the host device being based on at least the phonetic information and prosodic information, and includes identifying audio units from a first audio unit synthesis inventory on the host device, wherein the guest device comprises; a second audio unit synthesis inventory where audio units are selected from and selection of audio units from the second audio unit synthesis inventory being based on the audio index representation sent from the host device; and a unit-concatenative module for concatenating the selected audio units. - View Dependent Claims (2, 3)
-
Specification