Systems and methods for determining the language to use for speech generated by a text to speech engine
First Claim
1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
- identifying languages associated with each of the plurality of strings;
distinguishing at least two different identified languages; and
applying a series of rules to the plurality of text strings to select a single language for generating the speech content for the plurality of text strings.
1 Assignment
0 Petitions
Accused Products
Abstract
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized from text strings associated with media assets, where each text string can be associated with a native string language (e.g., the language of the string). When several text strings are associated with at least two distinct languages, a series of rules can be applied to the strings to identify a single voice language to use for synthesizing the speech content from the text strings. In some embodiments, a prioritization scheme can be applied to the text strings to identify the more important text strings. The rules can include, for example, selecting a voice language based on the prioritization scheme, a default language associated with an electronic device, the ability of a voice language to speak text in a different language, or any other suitable rule.
646 Citations
56 Claims
-
1. A method for synthesizing speech content based on a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
-
identifying languages associated with each of the plurality of strings; distinguishing at least two different identified languages; and applying a series of rules to the plurality of text strings to select a single language for generating the speech content for the plurality of text strings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
-
identifying languages associated with each of the plurality of strings; distinguishing at least two different identified languages; and applying a series of rules to the plurality of text strings to select a single language for generating the speech content for the plurality of text strings. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. Computer readable media for synthesizing speech content based on a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
-
identifying languages associated with each of the plurality of strings; distinguishing at least two different identified languages; and applying a series of rules to the plurality of text strings to select a single language for generating the speech content for the plurality of text strings. - View Dependent Claims (18, 19, 20)
-
-
21. A host device operative to provide speech content to an electronic device, comprising:
-
communications circuitry operative to receive; a set of text strings for which speech content is requested; and a default language associated with the electronic device; and a processor operative to; identify a title text string from the received set of text strings, wherein the title text string is associated with a title text string language; identify an artist text string from the received set of text strings, wherein the artist text string is associated with an artist text string language; determine that at least two of the title text string language, album text string language, and default language are different; and select one of the title text string language, album text string language, and default language for generating speech content for the title text string and album text string. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
-
-
29. A method for providing speech content, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
-
receiving a set of text strings for which speech content is requested; receiving a default language associated with the electronic device; identifying a title text string from the received set of text strings, wherein the title text string is associated with a title text string language; identifying an artist text string from the received set of text strings, wherein the artist text string is associated with an artist text string language; determining that at least two of the title text string language, album text string language, and default language are different; and selecting one of the title text string language, album text string language, and default language for generating speech content for the title text string and album text string. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36)
-
-
37. The Computer readable media for providing speech content, the computer readable media comprising computer readable instructions recorded thereon for:
-
receiving a set of text strings for which speech content is requested; receiving a default language associated with the electronic device; identify a title text string from the received set of text strings, wherein the title text string is associated with a title text string language; identify an artist text string from the received set of text strings, wherein the artist text string is associated with an artist text string language; determine that at least two of the title text string language, album text string language, and default language are different; and select one of the title text string language, album text string language, and default language for generating speech content for the title text string and album text string. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44)
-
-
45. A method for generating speech content for a plurality of text strings, the method implemented by at least one computing device having at least one processor and at least one program stored in memory, the method comprising:
-
identifying a plurality of text strings; assigning a rank to each of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining that the language of the lower rank text string is speakable in the language of the higher rank text string; and generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string. - View Dependent Claims (46, 47, 48)
-
-
49. An electronic device having at least one processor and memory storing at least one program for execution by the at least one processor, the at least one program including instructions for:
-
identifying a plurality of text strings; assigning a rank to each of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining that the language of the lower rank text string is speakable in the language of the higher rank text string; and generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string. - View Dependent Claims (50, 51, 52)
-
-
53. The Computer readable media for generating speech content for a plurality of text strings, the computer readable media comprising computer readable instructions recorded thereon for:
-
identifying a plurality of text strings; assigning a rank to each of the plurality of text strings; detecting that a language of a lower rank text string and a higher rank text string are different; determining that the language of the lower rank text string is speakable in the language of the higher rank text string; and generating speech content for at least the lower rank text string and the higher rank text string using the language of the higher rank text string. - View Dependent Claims (54, 55, 56)
-
Specification