Error correction in speech recognition by correcting text around selected area
First Claim
1. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
- performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving from a user a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing a first alternative transcript to the recognized utterance to be corrected;
producing a first recognition correction based on the comparison;
comparing a second alternative transcript to the recognized utterance to be corrected;
producing a second recognition correction based on the second comparison; and
replacing a portion of the recognition result with one of the first recognition correction and the second recognition correction;
wherein a duration of the first recognition correction differs from a duration of the second recognition correction, and the portion of the recognition result replaced includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced and includes at least one word other than the user-selected word.
11 Assignments
0 Petitions
Accused Products
Abstract
Correcting incorrect text associated with recognition errors in computer-implemented speech recognition includes receiving a selection of a word from a recognized utterance. The selection indicates a bound of a portion of the recognized utterance to be corrected. A first recognition correction is produced based on a comparison between a first alternative transcript and the recognized utterance. A second recognition correction is produced based on a comparison between a second alternative transcript and the recognized utterance. The duration of the first recognition correction differs from the duration of the second recognition correction. A portion of the recognition result that is replaced with one of the first recognition correction and the second recognition correction. includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced.
-
Citations
32 Claims
-
1. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving from a user a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing a first alternative transcript to the recognized utterance to be corrected;
producing a first recognition correction based on the comparison;
comparing a second alternative transcript to the recognized utterance to be corrected;
producing a second recognition correction based on the second comparison; and
replacing a portion of the recognition result with one of the first recognition correction and the second recognition correction;
wherein a duration of the first recognition correction differs from a duration of the second recognition correction, and the portion of the recognition result replaced includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced and includes at least one word other than the user-selected word. - View Dependent Claims (2, 3)
-
-
4. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing a first alternative transcript to the recognized utterance to be corrected;
producing a first recognition correction based on the comparison;
comparing a second alternative transcript to the recognized utterance to be corrected;
producing a second recognition correction based on the second comparison; and
replacing a portion of the recognition result with one of the first recognition correction and the second recognition correction;
wherein a duration of the first recognition correction differs from a duration of the second recognition correction, and the portion of the recognition result replaced includes at one bound a word indicated by the selection and extends for the duration of the one of the first recognition correction and the second recognition correction with which the portion is replaced, and wherein comparing an alternative transcript to the recognized utterance comprises;
selecting from the alternative transcript a test word that is not identical to the selected word and that begins at a time that is nearest a time at which the selected word begins; and
searching in time, through the recognized utterance relative to the selected word, and through the alternative transcript relative to the test word, until finding a word common to the recognized utterance and the alternative transcript. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected;
comparing an alternative transcript to the recognized utterance to be corrected, the comparing comprising;
selecting from the alternative transcript a test word that begins at a time that is nearest a time at which the selected word begins; and
searching in time, relative to the selected word, through the recognized utterance, and searching in time, relative to the test word, through the alternative transcript until finding a word common to the recognized utterance and the alternative transcript;
producing a recognition correction based on the comparison; and
replacing a portion of the recognition result with the recognition correction.
-
-
16. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
performing speech recognition on an utterance to produce a recognition result for the utterance;
receiving a selection of a word from the recognized utterance, the selection indicating a bound of a portion of the recognized utterance to be corrected; and
comparing an alternative transcript to the recognized utterance to be corrected, the comparing including;
selecting from the alternative transcript a test word that is not identical to the selected word and that occurs at a time that is nearest a time at which the selected word occurs, and searching in time, through the recognized utterance relative to the selected word, and through the alternative transcript relative to the test word, until finding a word common to the recognized utterance and the alternative transcript. - View Dependent Claims (17)
-
-
18. A method of correcting incorrect text associated with recognition errors in computer-implemented speech recognition, the method comprising:
-
receiving a text document formed by recognizing speech utterances using a vocabulary;
receiving a general confusability matrix having one or more values each indicating a likelihood of confusion between a first phoneme and a second phoneme;
receiving corrected text that corresponds to misrecognized text from the text document;
generating a sequence of phonemes for the corrected text;
aligning the generated sequence of phonemes with phonemes of the misrecognized text;
adjusting one or more values of the general confusability matrix based on the alignment to form a specific confusability matrix; and
searching the text document for additional instances of the corrected text using the specific confusability matrix. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification