System and method for intelligent language switching in automated text-to-speech systems
First Claim
1. A method comprising:
- receiving text having a first part of the text and a second part of the text, wherein the text is associated with one language;
identifying a recipient of speech to be generated from the text;
identifying a location of the recipient of the speech;
when the location comprises a first location;
selecting, via a processor, a first language for the first part of the text and a second language for the second part of the text;
generating, via the processor, first speech from the text, wherein the first speech comprises a first portion corresponding to the first part of the text and a second portion corresponding to the second part of the text, the first portion in the first language and the second portion in the second language; and
communicating the first speech to the recipient; and
when the location comprises a second location that differs from the first location;
generating second speech from the text wherein the second speech comprises the first portion and the second portion both being in a same language; and
communicating the second speech to the recipient.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer-readable storage media for providing for intelligent switching of languages and/or pronunciations in a text-to-speech system. As the system receives text, the text is analyzed to identify portions which should have speech constructed using a pronunciation distinct from the remaining portions of the text. The text-to-speech system uses multiple pronunciation dictionaries to generate and produce speech corresponding to the text, where the identified portions of the text are in a different language or have a different accent from the remainder of the text. Having generated speech corresponding to the text in multiple languages, accents, or dialects, the system combines the portions, then communicates the speech to the text recipient.
44 Citations
18 Claims
-
1. A method comprising:
-
receiving text having a first part of the text and a second part of the text, wherein the text is associated with one language; identifying a recipient of speech to be generated from the text; identifying a location of the recipient of the speech; when the location comprises a first location; selecting, via a processor, a first language for the first part of the text and a second language for the second part of the text; generating, via the processor, first speech from the text, wherein the first speech comprises a first portion corresponding to the first part of the text and a second portion corresponding to the second part of the text, the first portion in the first language and the second portion in the second language; and communicating the first speech to the recipient; and when the location comprises a second location that differs from the first location; generating second speech from the text wherein the second speech comprises the first portion and the second portion both being in a same language; and communicating the second speech to the recipient. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; receiving text having a first part of the text and a second part of the text, wherein the text is associated with one language; identifying a recipient of speech to be generated from the text; identifying a location of the recipient of the speech; when the location comprises a first location; selecting a first language for the first part of the text and a second language for the second part of the text; generating first speech from the text, wherein the first speech comprises a first portion corresponding to the first part of the text and a second portion corresponding to the second part of the text, the first portion in the first language and the second portion in the second language; and communicating the first speech to the recipient; and when the location comprises a second location that differs from the first location; generating second speech from the text wherein the second speech comprises the first portion and the second portion both being in a same language; and communicating the second speech to the recipient. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
receiving text having a first part of the text and a second part of the text, wherein the text is associated with one language; identifying a recipient of speech to be generated from the text; identifying a location of the recipient of the speech; when the location comprises a first location; selecting a first language for the first part of the text and a second language for the second part of the text; generating first speech from the text, wherein the first speech comprises a first portion corresponding to the first part of the text and a second portion corresponding to the second part of the text, the first portion in the first language and the second portion in the second language; and communicating the first speech to the recipient; and when the location comprises a second location that differs from the first location; generating second speech from the text wherein the second speech comprises the first portion and the second portion both being in a same language; and communicating the second speech to the recipient. - View Dependent Claims (17, 18)
-
Specification