Systems and methods of detecting language and natural language strings for text to speech synthesis
First Claim
1. A method for determining a native language of a text string associated with metadata of a media asset, the method comprising:
- at an electronic device comprising a processor and memory storing instructions for execution by the processor;
undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence of the text string, where each of the probabilities of occurrence correspond to a probability of occurrence of the text string in a particular language of a plurality of languages, wherein, for each language, the one or more N-gram analyses are based on a first set of probabilities of occurrence of words if the text string corresponds to a first type of metadata field associated with the media asset, and are based on a second set of probabilities of occurrence of words if the text string corresponds to a second type of metadata field associated with the media asset; and
determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence.
1 Assignment
0 Petitions
Accused Products
Abstract
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
562 Citations
20 Claims
-
1. A method for determining a native language of a text string associated with metadata of a media asset, the method comprising:
at an electronic device comprising a processor and memory storing instructions for execution by the processor; undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence of the text string, where each of the probabilities of occurrence correspond to a probability of occurrence of the text string in a particular language of a plurality of languages, wherein, for each language, the one or more N-gram analyses are based on a first set of probabilities of occurrence of words if the text string corresponds to a first type of metadata field associated with the media asset, and are based on a second set of probabilities of occurrence of words if the text string corresponds to a second type of metadata field associated with the media asset; and determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
Specification