Method and system for improved speech recognition by degrading utterance pronunciations
First Claim
1. In a speech recognition engine executing on a computer, a method of improving overall system recognition performance comprising steps of:
- identifying a confusable pair of words, comprising a target word and a substitute word, in a candidate set of incorrectly recognized words by;
(a) identifying a first word as being a word that is rarely misrecognized and is frequently returned as confusions for other words;
(b) identifying a second word as being a word for which the first word is frequently substituted;
(c) determining the first word to be the substitute word in the confusable pair; and
(d) determining the second word to be the target word in the confusable pair; and
adjusting a pronunciation of the substitute word to a worse pronunciation of the substitute word, wherein the adjusted pronunciation of the substitute word results in less accurate detection of the substitute word by the speech recognition engine.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system or method can include a speech input device and a processor coupled to the speech input device. The processor can be programmed to identify a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.
-
Citations
9 Claims
-
1. In a speech recognition engine executing on a computer, a method of improving overall system recognition performance comprising steps of:
-
identifying a confusable pair of words, comprising a target word and a substitute word, in a candidate set of incorrectly recognized words by; (a) identifying a first word as being a word that is rarely misrecognized and is frequently returned as confusions for other words; (b) identifying a second word as being a word for which the first word is frequently substituted; (c) determining the first word to be the substitute word in the confusable pair; and (d) determining the second word to be the target word in the confusable pair; and adjusting a pronunciation of the substitute word to a worse pronunciation of the substitute word, wherein the adjusted pronunciation of the substitute word results in less accurate detection of the substitute word by the speech recognition engine. - View Dependent Claims (2, 3)
-
-
4. A speech recognition computer-implemented system, comprising:
-
a speech input device; a processor coupled to the speech input device, wherein the processor is programmed to; identify a confusable pair of words, comprising a target word and a substitute word, in a candidate set of incorrectly recognized words by; (a) identifying a first word as being a word that is rarely misrecognized and is frequently returned as confusions for other words; (b) identifying a second word as being a word for which the first word is frequently substituted; (c) determining the first word to be the substitute word in the confusable pair; and (d) determining the second word to be the target word in the confusable pair; and adjust a pronunciation of the substitute word to a worse pronunciation of the substitute word, wherein the adjusted pronunciation of the substitute word results in less accurate detection of the substitute word by the speech recognition engine. - View Dependent Claims (5, 6)
-
-
7. A non-transitory machine-readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
identifying a confusable pair of words, comprising a target word and a substitute word, in a candidate set of incorrectly recognized words by; (a) identifying a first word as being a word that is rarely misrecognized and is frequently returned as confusions for other words; (b) identifying a second word as being a word for which the first word is frequently substituted; (c) determining the first word to be the substitute word in the confusable pair; and (d) determining the second word to be the target word in the confusable pair; and adjusting a pronunciation of the substitute word to a worse pronunciation of the substitute word, wherein the adjusted pronunciation of the substitute word results in less accurate detection of the substitute word by the speech recognition engine. - View Dependent Claims (8, 9)
-
Specification