Error correction in speech recognition
First Claim
Patent Images
1. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
- performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing a first alternative transcript to the recognized utterance to be corrected;
producing a first recognition correction based on the comparison;
comparing a second alternative transcript to the recognized utterance to be corrected;
producing a second recognition correction based on the second comparison; and
replacing a portion of the recognition result with one of the first recognition correction and the second recognition correction;
wherein a duration of the first recognition correction differs from a duration of the second recognition correction, and the portion of the recognition result replaced includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced.
11 Assignments
0 Petitions
Accused Products
Abstract
New techniques and systems may be implemented to improve error correction in speech recognition. These new techniques and systems may be implemented to correct errors in speech recognition systems may be used in a standard desktop environment, in a mobile environment, or in any other type of environment that can receive and/or present recognized speech.
367 Citations
53 Claims
-
1. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing a first alternative transcript to the recognized utterance to be corrected;
producing a first recognition correction based on the comparison;
comparing a second alternative transcript to the recognized utterance to be corrected;
producing a second recognition correction based on the second comparison; and
replacing a portion of the recognition result with one of the first recognition correction and the second recognition correction;
wherein a duration of the first recognition correction differs from a duration of the second recognition correction, and the portion of the recognition result replaced includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 17, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
15. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing an alternative transcript to the recognized utterance to be corrected, the comparing comprising;
selecting from the alternative transcript a test word that begins at a time that is nearest a time at which the selected word begins; and
searching in time, relative to the selected word, through the recognized utterance, and searching in time, relative to the test word, through the alternative transcript until finding a word common to the recognized utterance and the alternative transcript;
producing a recognition correction based on the comparison; and
replacing a portion of the recognition result with the recognition correction.
-
-
16. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected; and
comparing an alternative transcript to the recognized utterance to be corrected, the comparing including;
selecting from the alternative transcript a test word that is not identical to the selected word and that occurs at a time that is nearest a time at which the selected word occurs, and searching in time, through the recognized utterance relative to the selected word, and through the alternative transcript relative to the test word, until finding a word common to the recognized utterance and the alternative transcript.
-
-
18. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
receiving a text document formed by recognizing speech utterances using a vocabulary;
receiving a general confusability matrix having one or more values each indicating a likelihood of confusion between a first phoneme and a second phoneme;
receiving corrected text that corresponds to misrecognized text from the text document;
generating a sequence of phonemes for the corrected text;
aligning the generated sequence of phonemes with phonemes of the misrecognized text;
adjusting one or more values of the general confusability matrix based on the alignment to form a specific confusability matrix; and
searching the text document for additional instances of the corrected text using the specific confusability matrix.
-
-
33. A computer-implemented method for speech recognition, the method comprising:
-
receiving dictated text;
generating recognized speech based on the received dictated text, the generating comprising determining acoustic models for the dictated text that best match acoustic data for the dictated text;
receiving an edited text of the recognized speech, the edited text indicating a replacement for a portion of the dictated text;
determining an acoustic model for the edited text;
determining whether to adapt acoustic models for the edited text based on the acoustic model for the edited text and the acoustic model for the dictated text portion. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 47, 48, 49, 50, 51, 52, 53)
-
-
46. A computer-implemented method of speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of the recognition result;
receiving a correction of the recognition result;
performing speech recognition on the correction using a constraint grammar that permits spelling and pronunciation in parallel; and
identifying whether the correction comprises a spelling or a pronunciation using the constraint grammar.
-
Specification