Method and system for identifying and correcting accent-induced speech recognition difficulties
First Claim
1. A method comprising:
- generating a first speech recognition result for a first speech input in a first language provided by a speaker; and
outputting the first speech recognition result;
wherein generating the first speech recognition result comprises;
identifying a sequence of phonemes corresponding to the first speech input using a native acoustic model specific to a second language different from the first language, wherein the second language is the speaker'"'"'s native language; and
matching the sequence of phonemes to one or more speech segments and/or words using a lexicon model specific to the first language and not to the second language.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for use in speech recognition includes an acoustic module accessing a plurality of distinct-language acoustic models, each based upon a different language; a lexicon module accessing at least one lexicon model; and a speech recognition output module. The speech recognition output module generates a first speech recognition output using a first model combination that combines one of the plurality of distinct-language acoustic models with the at least one lexicon model. In response to a threshold determination, the speech recognition output module generates a second speech recognition output using a second model combination that combines a different one of the plurality of distinct-language acoustic models with the at least one distinct-language lexicon model.
399 Citations
23 Claims
-
1. A method comprising:
-
generating a first speech recognition result for a first speech input in a first language provided by a speaker; and outputting the first speech recognition result; wherein generating the first speech recognition result comprises; identifying a sequence of phonemes corresponding to the first speech input using a native acoustic model specific to a second language different from the first language, wherein the second language is the speaker'"'"'s native language; and matching the sequence of phonemes to one or more speech segments and/or words using a lexicon model specific to the first language and not to the second language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising a combination of hardware and software that implements:
-
an audio capture module configured to capture a first speech input in a first language from a speaker; and a speech recognition module configured to generate a first speech recognition result for the first speech input, and output the first speech recognition result; wherein generating the first speech recognition result comprises; identifying a sequence of phonemes corresponding to the first speech input using an acoustic model specific to a second language different from the first language, wherein the acoustic model is trained on speech of a training speaker, wherein the second language is the training speaker'"'"'s native language; and matching the sequence of phonemes to one or more speech segments and/or words using a lexicon model specific to the first language and not to the second language. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. An article of manufacture comprising a computer-readable storage medium storing computer instructions for:
-
generating a first speech recognition result for a first speech input in a first language; and outputting the first speech recognition result; wherein generating the first speech recognition result comprises; identifying a sequence of phonemes corresponding to the first speech input using an acoustic model specific to a second language different from the first language, wherein the acoustic model is trained on speech of a training speaker, wherein the second language is the training speaker'"'"'s native language; and matching the sequence of phonemes to one or more speech segments and/or words using a lexicon model specific to the first language and not to the second language. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
-
Specification