Word-level correction of speech input
First Claim
1. A computer-implemented method comprising:
- obtaining first and second transcriptions of an utterance from an automated speech recognizer, wherein the second transcription of the utterance represents an alternate recognition result to the first transcription of the utterance, and wherein a portion of the first transcription of the utterance is different than a corresponding portion of the second transcription of the utterance;
providing the first transcription of the utterance for output;
receiving data indicating a single selection of the portion of the first transcription of the utterance; and
in response to receiving the data indicating the single selection, providing the second transcription of the utterance for output.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
Citations
19 Claims
-
1. A computer-implemented method comprising:
-
obtaining first and second transcriptions of an utterance from an automated speech recognizer, wherein the second transcription of the utterance represents an alternate recognition result to the first transcription of the utterance, and wherein a portion of the first transcription of the utterance is different than a corresponding portion of the second transcription of the utterance; providing the first transcription of the utterance for output; receiving data indicating a single selection of the portion of the first transcription of the utterance; and in response to receiving the data indicating the single selection, providing the second transcription of the utterance for output. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented system for correcting words in transcribed text, the system comprising:
-
an automated speech recognizer operable to receive speech audio data and in response transcribe the speech audio data in a word lattice; and a computing device comprising; a microphone operable to receive speech audio and generate the speech audio data, a network interface operable to send the speech audio data to the automated speech recognizer and in response receive the word lattice from the automated speech recognizer, a display screen operable to present one or more transcribed words from the word lattice, a user interface operable to receive a user selection of at least one of the transcribed words, and one or more processors and a memory storing instructions that when executed by the processors cause the computing device to perform operations to; provide the user interface that includes (i) an output area for outputting a first transcription of an utterance, and (ii) a control associated with a second transcription of the utterance, wherein the second transcription of the utterance represents an alternate recognition result to the first transcription of the utterance, and wherein a portion of the first transcription of the utterance is different than a corresponding portion of the second transcription of the utterance; present, at the output area, the first transcription of the utterance, wherein the first transcription of the utterance includes one or more words from the word lattice; receive data indicating a selection of the control associated with the second transcription of the utterance; and update the output area to replace the first transcription of the utterance with the second transcription of the utterance. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer program product, encoded on a non-transitory computer-readable medium, operable to cause one or more processors to perform operations for correcting words in transcribed text, the operations comprising:
-
providing, from a word lattice obtained by an automated speech recognizer, a first transcription of an utterance, the first transcription of the utterance including one or more words; receiving data indicating a single selection of a single word from the one or more words of the first transcription of the utterance; in response to receiving the data indicating the single selection, identifying an alternate word from the word lattice that corresponds to the single word; determining that the alternate word has a highest speech recognizer confidence measure value among all alternate words for the single word that are in the word lattice; selecting, from the word lattice, a second transcription of the utterance that includes the alternate word; and replacing the first transcription of the utterance with the second transcription of the utterance. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification