Systems and Methods for Determining the Language to Use for Speech Generated by a Text to Speech Engine
First Claim
1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
- identifying a respective language associated with each respective one of the plurality of text strings;
distinguishing at least two different identified languages; and
using one or more rules to select a single language for generating the speech content for the plurality of text strings.
0 Assignments
0 Petitions
Accused Products
Abstract
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized from text strings associated with media assets, where each text string can be associated with a native string language (e.g., the language of the string). When several text strings are associated with at least two distinct languages, a series of rules can be applied to the strings to identify a single voice language to use for synthesizing the speech content from the text strings. In some embodiments, a prioritization scheme can be applied to the text strings to identify the more important text strings. The rules can include, for example, selecting a voice language based on the prioritization scheme, a default language associated with an electronic device, the ability of a voice language to speak text in a different language, or any other suitable rule.
-
Citations
16 Claims
-
1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
-
identifying a respective language associated with each respective one of the plurality of text strings; distinguishing at least two different identified languages; and using one or more rules to select a single language for generating the speech content for the plurality of text strings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for generating speech content for a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
-
identifying a plurality of text strings; assigning a respective rank to each respective one of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string. - View Dependent Claims (10, 11, 12)
-
-
13. Computer readable media for synthesizing speech content based on a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
-
identifying a respective language associated with each respective one of the plurality of text strings; distinguishing at least two different identified languages; and using one or more rules to select a single language for generating the speech content for the plurality of text strings.
-
-
14. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
-
identifying a respective language associated with each respective one of the plurality of text strings; distinguishing at least two different identified languages; and using one or more rules to select a single language for generating the speech content for the plurality of text strings.
-
-
15. Computer readable media for generating speech content for a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
-
identifying a plurality of text strings; assigning a respective rank to each respective one of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string.
-
-
16. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
-
identifying a plurality of text strings; assigning a respective rank to each respective one of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining whether the language of the lower rank text string is speakable in the language of the higher rank text string; and in accordance with a determination that the language of the lower rank text string is speakable in the language of the higher rank text string, generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string.
-
Specification