Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
First Claim
Patent Images
1. A method for generating base forms for non-native language in a speech-based system trained for processing a native language, the method comprising:
- receiving input textual data containing both native language and non-native language words;
identifying the native language and non-native language words within the textual data;
generating a native phonetic transcription of the native language word using phonetic units of the native language;
generating a non-native phonetic transcription of the non-native language word using phonetic units of the non-native language;
generating a native pronunciation of the non-native language word using phonetic units of the native language by mapping the phonetic units of the non-native phonetic transcription to acoustically similar phonetic units of the native language; and
storing the input textual data with the corresponding native phonetic transcription of the native language word and the mapped native pronunciation of the non-native language word in a native phonetic lexicon.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon includes native phonetic transcriptions (base forms) for non-native (foreign) words which are automatically derived from non-native phonetic transcriptions of the non-native words.
-
Citations
1 Claim
-
1. A method for generating base forms for non-native language in a speech-based system trained for processing a native language, the method comprising:
-
receiving input textual data containing both native language and non-native language words; identifying the native language and non-native language words within the textual data; generating a native phonetic transcription of the native language word using phonetic units of the native language; generating a non-native phonetic transcription of the non-native language word using phonetic units of the non-native language; generating a native pronunciation of the non-native language word using phonetic units of the native language by mapping the phonetic units of the non-native phonetic transcription to acoustically similar phonetic units of the native language; and storing the input textual data with the corresponding native phonetic transcription of the native language word and the mapped native pronunciation of the non-native language word in a native phonetic lexicon.
-
Specification