SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS
First Claim
1. A method for determining a native language of a text string associated with a media asset, the method comprising:
- undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence, each of which correspond to a probability of occurrence of the text string in a particular language, wherein the probability of occurrence of the text string in the particular language is based partly on a type of text string associated with the media asset; and
determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence.
1 Assignment
0 Petitions
Accused Products
Abstract
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
-
Citations
16 Claims
-
1. A method for determining a native language of a text string associated with a media asset, the method comprising:
-
undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence, each of which correspond to a probability of occurrence of the text string in a particular language, wherein the probability of occurrence of the text string in the particular language is based partly on a type of text string associated with the media asset; and determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16)
-
-
11. The method of claim 11 wherein the probability of occurrence of the word in the particular language is based partly on the type of text string associated with the media asset.
Specification