Speech recognition error identification method and system
First Claim
1. A method for testing and improving the performance of a speech recognition engine, comprising:
- identifying one or more words, phrases or utterances for recognition by a speech recognition engine;
passing the one or more identified words, phrases or utterances to a text-to-speech conversion module;
passing an audio pronunciation of each of the identified one or more words, phrases or utterances from the text-to-speech conversion module to the speech recognition engine;
creating a recognized word, phrase or utterance for each audio pronunciation passed to the speech recognition engine; and
analyzing each recognized word, phrase or utterance to determine how closely each recognized word, phrase or utterance approximates the respective audio pronunciation from which each recognized word, phrase or utterance is derived.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems are provided for testing and improving the performance of a speech recognition system. Words, phrases or utterances are assembled for recognition by one or more speech recognition engines. At a text-to-speech application, an audio pronunciation of each word, phrase or utterance is created. Each audio pronunciation is passed to one or more speech recognition engines. The speech recognition engine analyzes the audio pronunciations and derives one or more words, phrases or utterances from the audio pronunciations. A confidence score is assigned to each of the one or more words, phrases or utterances derived from the audio pronunciations. If the confidence score for any derived word, phrase or utterance is below an acceptable threshold, the results of the speech recognition engine for the word, phrase or utterance are passed to a developer to allow the developer to take corrective action with respect to the speech recognition engine.
-
Citations
24 Claims
-
1. A method for testing and improving the performance of a speech recognition engine, comprising:
-
identifying one or more words, phrases or utterances for recognition by a speech recognition engine;
passing the one or more identified words, phrases or utterances to a text-to-speech conversion module;
passing an audio pronunciation of each of the identified one or more words, phrases or utterances from the text-to-speech conversion module to the speech recognition engine;
creating a recognized word, phrase or utterance for each audio pronunciation passed to the speech recognition engine; and
analyzing each recognized word, phrase or utterance to determine how closely each recognized word, phrase or utterance approximates the respective audio pronunciation from which each recognized word, phrase or utterance is derived. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for testing and improving the performance of a speech recognition engine, comprising:
-
a text-to-speech conversion module operative to receive one or more identified words, phrases or utterances;
to create and to pass an audio pronunciation of each of the identified one or more words, phrases or utterances the speech recognition engine;
the speech recognition engine operative to create a recognized word, phrase or utterance for each audio pronunciation; and
to analyze each recognized word, phrase or utterance to determine how closely each recognized word, phrase or utterance approximates the respective audio pronunciation from which each recognized word, phrase or utterance is derived. - View Dependent Claims (19, 20, 21)
-
-
22. A method for testing and improving the performance of a speech recognition engine, comprising:
-
identifying one or more words, phrases or utterances for recognition by a speech recognition engine;
creating and passing an audio pronunciation of each of the identified one or more words, phrases or utterances from a text-to-speech conversion module to the speech recognition engine;
deriving a recognized word, phrase or utterance for each audio pronunciation passed to the speech recognition engine;
assigning a confidence score to each recognized word, phrase or utterance based on the speech recognition engine'"'"'s confidence in each recognized word, phrase or utterance based on prior training of the speech recognition engine to recognize similar or same words, phrases or utterances as the each recognized word, phrase or utterance; and
if the confidence score is less than an acceptable threshold, modifying the speech recognition engine to recognize the word, phrase or utterance from which the recognized word, phrase or utterance is derived with higher accuracy. - View Dependent Claims (23, 24)
-
Specification