Pronunciation correction of text-to-speech systems between different spoken languages
First Claim
1. A method of correcting pronunciation generation of a language pronunciation system, comprising:
- receiving a word according to an incoming language requiring electronic pronunciation according to a target language;
determining whether the word requiring electronic pronunciation is a word of the target language;
if the word requiring electronic pronunciation is not a word of the target language, retrieving language locale for the word;
determining whether a language locale for the word matches a language locale for a pronunciation system responsible for converting the word to speech or recognizing a spoken form of the word;
if a language locale for the word does not match a language locale for a pronunciation system responsible for converting the word to speech or for recognizing an audible form of the word, mapping phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language; and
passing an output of the mapping of phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language to the pronunciation system for converting the word to speech or for recognizing an audible form of the word.
2 Assignments
0 Petitions
Accused Products
Abstract
Pronunciation correction for text-to-speech (TTS) systems and speech recognition (SR) systems between different languages is provided. If a word requiring pronunciation by a target language TTS or SR is from a same language as the target language, but is not found in a lexicon of words from the target language, a letter-to-speech (LTS) rules set of the target language is used to generate a letter-to-speech output for the word for use by the TTS or SR configured according to the target language. If the word is from a different language as the target language, phonemes comprising the word according to its native language are mapped to phonemes of the target language. The phoneme mapping is used by the TTS or SR configured according to the target language for generating or recognizing an audible form of the word according to the target language.
-
Citations
20 Claims
-
1. A method of correcting pronunciation generation of a language pronunciation system, comprising:
-
receiving a word according to an incoming language requiring electronic pronunciation according to a target language; determining whether the word requiring electronic pronunciation is a word of the target language; if the word requiring electronic pronunciation is not a word of the target language, retrieving language locale for the word; determining whether a language locale for the word matches a language locale for a pronunciation system responsible for converting the word to speech or recognizing a spoken form of the word; if a language locale for the word does not match a language locale for a pronunciation system responsible for converting the word to speech or for recognizing an audible form of the word, mapping phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language; and passing an output of the mapping of phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language to the pronunciation system for converting the word to speech or for recognizing an audible form of the word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer readable medium containing computer executable instructions which when executed by a computer perform a method of correcting pronunciation generation of a language pronunciation system, comprising:
-
receiving a word according to an incoming language requiring electronic pronunciation according to a target language; determining whether the word requiring electronic pronunciation is a word of the target language; if the word requiring electronic pronunciation is not a word of the target language, retrieving language locale for the word; determining whether a language locale for the word matches a language locale for a pronunciation system responsible for converting the word to speech or recognizing a spoken form of the word; if a language locale for the word matches a language locale for a pronunciation system responsible for converting the word to speech or for recognizing an audible form of the word, applying a letter-to-speech (LTS) rules system associated with the target language to the word for generating an audible form of the word according to the LTS rules system; and passing an output of the application of the LTS rules associated with the target language to the word to the pronunciation system for converting the word to speech or for recognizing an audible form of the word. - View Dependent Claims (10, 11, 12)
-
-
13. A computer readable medium containing computer executable instructions which when executed by a computer perform a method of correcting pronunciation generation of a language pronunciation system, comprising:
-
receiving a word according to an incoming language requiring electronic pronunciation according to a target language; determining whether the word requiring electronic pronunciation is a word of the target language; if the word requiring electronic pronunciation is not a word of the target language, retrieving language locale for the word; determining whether a language locale for the word matches a language locale for a pronunciation system responsible for converting the word to speech or recognizing a spoken form of the word; if a language locale for the word does not match a language locale for a pronunciation system responsible for converting the word to speech or for recognizing an audible form of the word, mapping phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language; and passing an output of the mapping of phonemes comprising the word according to the incoming language to corresponding phonemes associated with the target language to the pronunciation system for converting the word to speech or for recognizing an audible form of the word. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification