DETERMINING TEXT TO SPEECH PRONUNCIATION BASED ON AN UTTERANCE FROM A USER
First Claim
Patent Images
1. A speech-based system comprising:
- at least one storage device that stores;
an input text comprising at least one word of a first language;
information indicative of a first pronunciation of the at least one word of the first language;
information indicative of a second pronunciation of the at least one word of the first language;
an automatic speech recognition (ASR) system configured to;
receive at least one utterance from a user, the utterance comprising the at least one word of the first language; and
determine whether the at least one utterance from the user used the first pronunciation or the second pronunciation of the at least one word of the first language; and
a text to speech (TTS) system configured to generate an audio speech output comprising the at least one word of the first language of the input text, and to pronounce the at least one word in the audio speech output using the pronunciation the ASR system determined the user used in the at least one utterance.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon includes native phonetic transcriptions (base forms) for non-native (foreign) words which are automatically derived from non-native phonetic transcriptions of the non-native words.
250 Citations
20 Claims
-
1. A speech-based system comprising:
-
at least one storage device that stores; an input text comprising at least one word of a first language; information indicative of a first pronunciation of the at least one word of the first language; information indicative of a second pronunciation of the at least one word of the first language; an automatic speech recognition (ASR) system configured to; receive at least one utterance from a user, the utterance comprising the at least one word of the first language; and determine whether the at least one utterance from the user used the first pronunciation or the second pronunciation of the at least one word of the first language; and a text to speech (TTS) system configured to generate an audio speech output comprising the at least one word of the first language of the input text, and to pronounce the at least one word in the audio speech output using the pronunciation the ASR system determined the user used in the at least one utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising acts, performed by at least one processor, of:
-
receiving at least one utterance from a user, the utterance comprising at least one word of the first language, wherein the at least one word has a first pronunciation and a second pronunciation; determining whether the utterance from the user used the first pronunciation or the second pronunciation of the at least one word of the first language; and generating an audio speech output that comprises the at least one word of the first language of the input text and that pronounces the at least one word using the pronunciation the user was determined to have used in the at least one utterance. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. At least one program storage device having encoded thereon executable program code that, when executed by at least one processor, performs a method comprising acts of:
-
receiving at least one utterance from a user, the utterance comprising at least one word of the first language, wherein the at least one word has a first pronunciation and a second pronunciation; determining whether the utterance from the user used the first pronunciation or the second pronunciation of the at least one word of the first language; and generating an audio speech output that comprises the at least one word of the first language of the input text and that pronounces the at least one word using the pronunciation the user was determined to have used in the at least one utterance. - View Dependent Claims (17, 18, 19, 20)
-
Specification