SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS

US 20100082329A1
Filed: 09/29/2008
Published: 04/01/2010
Est. Priority Date: 09/29/2008
Status: Active Grant

First Claim

Patent Images

1. A method for determining a native language of a text string associated with a media asset, the method comprising:

undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence, each of which correspond to a probability of occurrence of the text string in a particular language, wherein the probability of occurrence of the text string in the particular language is based partly on a type of text string associated with the media asset; and

determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

Citations

16 Claims

1. A method for determining a native language of a text string associated with a media asset, the method comprising:
- undergoing one or more N-gram analyses at a word level to determine a plurality of probabilities of occurrence, each of which correspond to a probability of occurrence of the text string in a particular language, wherein the probability of occurrence of the text string in the particular language is based partly on a type of text string associated with the media asset; and
  
  determining that the native language of the text string is a language that is associated with the highest probability of occurrence out of the plurality of probabilities of occurrence.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16)
- - 2. The method of claim 1 wherein the one or more N-gram analyses at a word level comprises:
    - for each group of a number N of words in the text string, retrieving a plurality of probabilities, each of which corresponds to a particular language and represents the probability of occurrence of that group of N words in that particular language; and
      
      for each language, calculating a total sum of the retrieved probabilities.
  - 3. The method of claim 2 wherein determining the native language of the text string comprises determining that the native language is a language having the highest calculated total sum.
  - 4. The method of claim 1 wherein the one or more N-gram analyses at a word level comprises a unigram analysis wherein, for each word in the text string, a plurality of probabilities are retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that word in that particular language.
  - 5. The method of claim 1 wherein the one or more N-gram analyses at a word level comprises a bigram analysis wherein, for each group of two adjacent words in the text string, a plurality of probabilities are retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that group of words in that particular language.
  - 6. The method of claim 1 wherein the one or more N-gram analyses at a word level comprises a trigram analysis wherein, for each group of three adjacent words in the text string, a plurality of probabilities is retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that group of words in that particular language.
  - 7. The method of claim 1 wherein the one or more N-gram analyses at a word level comprises any combination of a unigram analysis, a bigram analysis and a trigram analysis, wherein total probability sums are calculated under each such analysis and are weighted differently.
  - 8. The method of claim 1 further comprising separating the text string into distinct words.
  - 9. The method of claim 1 further comprising determining whether each word in the text string is in vocabulary by consulting a table that includes a list of words that are known in all known languages.
  - 10. The method of claim 9 wherein, for each word that is not in vocabulary, undergoing one or more N-gram analyses at a character level to determine a plurality of probabilities of occurrence, each of which corresponding to a probability of occurrence of the word in a particular language.
  - 12. The method of claim 10 wherein the one or more N-gram analyses at a character level comprises:
    - for each group of a number N of characters in the word that is not in vocabulary, retrieving a plurality of probabilities, each of which corresponds to a particular language and represents the probability of occurrence of that group of N characters in that particular language; and
      
      for each language, calculating a total sum of the retrieved probabilities.
  - 13. The method of claim 10 wherein the one or more N-gram analyses at a character level comprises a unigram analysis wherein, for each character in the word that is not in vocabulary, a plurality of probabilities are retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that character in that particular language.
  - 14. The method of claim 10 wherein the one or more N-gram analyses at a character level comprises a bigram analysis wherein, for each group of two adjacent characters in the word that is not in vocabulary, a plurality of probabilities are retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that group of characters in that particular language.
  - 15. The method of claim 10 wherein the one or more N-gram analyses at a character level comprises a trigram analysis wherein, for each group of three adjacent characters in the word that is not in vocabulary, a plurality of probabilities are retrieved, each of which corresponds to a particular language and represents the probability of occurrence of that group of characters in that particular language.
  - 16. The method of claim 10 wherein the one or more N-gram analyses at a character level comprises any combination of a unigram analysis, a bigram analysis and a trigram analysis, wherein total probability sums are calculated under each such analysis and are weighted differently.

11. The method of claim 11 wherein the probability of occurrence of the word in the particular language is based partly on the type of text string associated with the media asset.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Apple Inc.
Original Assignee
Apple Inc.
Inventors
Henton, Caroline, Lenzo, Kevin, Naik, Devang, Silverman, Kim

Granted Patent

US 8,583,418 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/8
CPC Class Codes

G10L 13/08 Text analysis or generation...

G10L 15/005 Language recognition

SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links