Corrective feedback loop for automated speech recognition
First Claim
Patent Images
1. A computer-implemented method comprising:
- under control of a computing device configured with specific computer-executable instructions,generating audio data comprising speech;
transmitting the audio data to a remote computing system including a speech recognition engine;
receiving, from the remote computing system, a plurality of transcription results for a portion of a transcription of the speech, wherein the transcription has been generated from the audio data by the speech recognition engine;
receiving, from the remote computing system, a confidence level for each transcription result of the plurality of transcription results, wherein the confidence level for each transcription result has been generated by the speech recognition engine, and wherein the confidence level for each transcription result of the plurality of transcription results represents a confidence in an accuracy of the transcription result;
determining a ranked order for the plurality of transcription results from the confidence levels of the plurality of transcription results;
presenting the plurality of transcription results for the portion of the transcription in the ranked order, with each transcription result of the plurality of transcription results presented with the confidence level for the transcription result; and
receiving a selection, from the plurality of transcription results, of a first transcription result for the portion of the transcription.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
-
Citations
16 Claims
-
1. A computer-implemented method comprising:
under control of a computing device configured with specific computer-executable instructions, generating audio data comprising speech; transmitting the audio data to a remote computing system including a speech recognition engine; receiving, from the remote computing system, a plurality of transcription results for a portion of a transcription of the speech, wherein the transcription has been generated from the audio data by the speech recognition engine; receiving, from the remote computing system, a confidence level for each transcription result of the plurality of transcription results, wherein the confidence level for each transcription result has been generated by the speech recognition engine, and wherein the confidence level for each transcription result of the plurality of transcription results represents a confidence in an accuracy of the transcription result; determining a ranked order for the plurality of transcription results from the confidence levels of the plurality of transcription results; presenting the plurality of transcription results for the portion of the transcription in the ranked order, with each transcription result of the plurality of transcription results presented with the confidence level for the transcription result; and receiving a selection, from the plurality of transcription results, of a first transcription result for the portion of the transcription. - View Dependent Claims (2, 3)
-
4. A computer-implemented method comprising:
-
generating, at a computing device, audio data comprising speech; transmitting, by the computing device, the audio data to a remote computing system, wherein the remote computing system includes a speech recognition engine; receiving, at the computing device, a transcription of the speech from the remote computing system, wherein the transcription that has been generated from the audio data by the speech recognition engine of the remote computing system, and wherein the transcription includes a portion having a first transcription result and a second transcription result; receiving, at the computing device, a confidence level for the first transcription result and a confidence level for the second transcription result, wherein the confidence level for the first transcription result and the confidence level for the second transcription result have been generated by the speech recognition engine, and wherein the confidence level for the first transcription result is greater than the confidence level for the second transcription result; presenting, at the computing device, the transcription with the first transcription result for the portion; presenting, at the computing device, the second transcription result for the portion for selection as an alternative to the first transcription result for the portion; presenting at the computing device, at least the confidence level for the second transcription result; and receiving, at the computing device, selection of the second transcription result as the alternative to the first transcription result for the portion. - View Dependent Claims (5, 6, 7)
-
-
8. A device comprising:
-
a microphone configured to capture speech; a memory configured to store audio data corresponding to the speech; and a processor in communication with the microphone and the memory, the processor configured to execute specific computer-executable instructions to at least; provide the audio data to a speech recognition system; receive a transcription of the speech from the speech recognition system, wherein the transcription that has been generated from the audio data by the speech recognition system, and wherein the transcription includes a portion having a first transcription result and a second transcription result; receive a confidence level for the first transcription result and a confidence level for the second transcription result, wherein the confidence level for the first transcription result and the confidence level for the second transcription result have been generated by the speech recognition engine, and wherein the confidence level for the first transcription result is greater than the confidence level for the second transcription result; present the transcription with the first transcription result for the portion; present the second transcription result for the portion for selection as an alternative to the first transcription result for the portion; present at least the confidence level for the second transcription result; and receive a selection of the second transcription result as the alternative to the first transcription result for the portion. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computing device comprising:
-
a memory configured to store audio data comprising speech; and a processor in communication with the memory, the processor configured to execute specific computer-executable instructions to at least; transmit the audio data comprising speech to a remote computing system including a speech recognition engine; receive, from the remote computing system, a plurality of transcription results for a portion of a transcription of the speech, wherein the transcription has been generated from the audio data by the speech recognition engine; receive, from the remote computing system, a confidence level for each transcription result of the plurality of transcription results, wherein the confidence level for each transcription result has been generated by the speech recognition engine, and wherein the confidence level for each transcription result of the plurality of transcription results represents a confidence in an accuracy of the transcription result; and determine a ranked order for the plurality of transcription results from the confidence level'"'"'s of the plurality of transcription results; present the plurality of transcription results for the portion of the transcription in the ranked order, with each transcription result of the plurality of transcription results presented with the confidence level for the transcription result; and receive a selection, from the plurality of transcription results, of a first transcription result for the portion of the transcription. - View Dependent Claims (14, 15, 16)
-
Specification