Systems and Methods for Determining the Language to Use for Speech Generated by a Text to Speech Engine

US 20130166278A1
Filed: 02/15/2013
Published: 06/27/2013
Est. Priority Date: 03/09/2009
Status: Active Grant

First Claim

Patent Images

1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:

identifying a respective language associated with each respective one of the plurality of text strings;

distinguishing at least two different identified languages; and

using one or more rules to select a single language for generating the speech content for the plurality of text strings.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized from text strings associated with media assets, where each text string can be associated with a native string language (e.g., the language of the string). When several text strings are associated with at least two distinct languages, a series of rules can be applied to the strings to identify a single voice language to use for synthesizing the speech content from the text strings. In some embodiments, a prioritization scheme can be applied to the text strings to identify the more important text strings. The rules can include, for example, selecting a voice language based on the prioritization scheme, a default language associated with an electronic device, the ability of a voice language to speak text in a different language, or any other suitable rule.

Citations

16 Claims

1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
- identifying a respective language associated with each respective one of the plurality of text strings;
  
  distinguishing at least two different identified languages; and
  
  using one or more rules to select a single language for generating the speech content for the plurality of text strings.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, further comprising assigning a priority to each of the plurality of text strings, including at assigning at least high priority text string and at least one low priority text string.
  - 3. The method of claim 2, further comprising:
    - identifying a default language associated with an electronic device providing the speech content;
      
      determining whether the identified languages are speakable in the default language; and
      
      in accordance with a determination that the identified languages are speakable in the default language, generating the speech content using the default language.
  - 4. The method of claim 3, wherein determining further comprises:
    - determining whether a minimum amount of speech content to be generated in the default language from a particular text string in a language other than the default language will be understandable.
  - 5. The method of claim 2, further comprising:
    - determining whether the identified language of a low priority text string is speakable in the language of a high priority text string; and
      
      in accordance with a determination that the identified language of the low priority text string is speakable in the language of the high priority text string, generating the speech content using the language of the high priority text string.
  - 6. The method of claim 2, further comprising:
    - determining whether the identified language of a low priority text string is speakable in the identified language of a high priority text string; and
      
      in accordance with a determination that the identified language of the low priority text string is not speakable in the language of the high priority text string, generating the speech content using a default language associated with an electronic device providing the speech content.
  - 7. The method of claim 6, further comprising:
    - determining whether the low priority text string is speakable in the default language; and
      
      in accordance with a determination that the low priority text string is not speakable in the default language, generating the speech content using the language of the high priority text string.
  - 8. The method of claim 7, further comprising:
    - determining whether generating the speech content associated with the low priority text string using the identified language of the high priority text string will result in no audio output; and
      
      in accordance with a determination that generating the speech content associated with the low priority text string using the identified language of the high priority text string will result in no audio output, generating the speech content using an arbitrary language.

9. A method for generating speech content for a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
- identifying a plurality of text strings;
  
  assigning a respective rank to each respective one of the plurality of text strings;
  
  detecting that a language of a lower rank text string and a higher rank text string are different;
  
  determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and
  
  in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string.
- View Dependent Claims (10, 11, 12)
- - 10. The method of claim 9, wherein:
    - the low priority text string comprises an artist name; and
      
      the high priority text string comprises at least one of a track name and an album name.
  - 11. The method of claim 9, further comprising:
    - identifying a default language associated with a personal electronic device providing the speech content;
      
      determining whether the language of the lower rank text string and the language of the higher rank text string are both speakable in the default language; and
      
      in accordance with a determination that the language of the lower rank text string and the language of the higher rank text string are both speakable in the default language, generating speech content for the lower rank text string and the higher rank text string using the default language.
  - 12. The method of claim 11, wherein:
    - the language of at least one of the lower rank text string and the higher rank text string is the default language.

13. Computer readable media for synthesizing speech content based on a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
- identifying a respective language associated with each respective one of the plurality of text strings;
  
  distinguishing at least two different identified languages; and
  
  using one or more rules to select a single language for generating the speech content for the plurality of text strings.

14. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
- identifying a respective language associated with each respective one of the plurality of text strings;
  
  distinguishing at least two different identified languages; and
  
  using one or more rules to select a single language for generating the speech content for the plurality of text strings.

15. Computer readable media for generating speech content for a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
- identifying a plurality of text strings;
  
  assigning a respective rank to each respective one of the plurality of text strings;
  
  detecting that a language of a lower rank text string and a higher rank text string are different;
  
  determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and
  
  in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string.

16. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
- identifying a plurality of text strings;
  
  assigning a respective rank to each respective one of the plurality of text strings;
  
  detecting that a language of a lower rank text string and a higher rank text string are different;
  
  determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and
  
  in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Apple Inc.
Original Assignee
Apple Inc.
Inventors
James, Bryan, Herman, Kenneth, Rogers, Matthew L.

Granted Patent

US 8,751,238 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/8
CPC Class Codes

G10L 13/033 Voice editing, e.g. manipul...

G10L 15/005 Language recognition

Systems and Methods for Determining the Language to Use for Speech Generated by a Text to Speech Engine

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Systems and Methods for Determining the Language to Use for Speech Generated by a Text to Speech Engine

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links