SYSTEMS AND METHODS FOR SELECTIVE TEXT TO SPEECH SYNTHESIS
First Claim
1. A method for selectively synthesizing speech based on a text string, the method comprising:
- parsing through the text string and selecting a first subset of text for which to synthesize speech, and a second subset of text for which not to synthesize speech; and
with respect to only the first subset of text, determining a first set of phonemes in a native language of the text string and converting the first set of phonemes into a second set of phonemes in a target language, the second set of phonemes dictating how to render speech based on the first subset of text.
1 Assignment
0 Petitions
Accused Products
Abstract
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
-
Citations
2 Claims
-
1. A method for selectively synthesizing speech based on a text string, the method comprising:
-
parsing through the text string and selecting a first subset of text for which to synthesize speech, and a second subset of text for which not to synthesize speech; and with respect to only the first subset of text, determining a first set of phonemes in a native language of the text string and converting the first set of phonemes into a second set of phonemes in a target language, the second set of phonemes dictating how to render speech based on the first subset of text. - View Dependent Claims (2)
-
Specification