Corrective feedback loop for automated speech recognition
First Claim
Patent Images
1. A system comprising:
- an electronic data store configured to store;
one or more algorithms that, when executed, implement an automatic speech recognition engine; and
an initial language model; and
a computing device in communication with the electronic data store, the computing device configured to;
obtain audio data comprising speech;
generate a transcription of the speech with the initial language model;
generate an identifier associated with at least one of the audio data or the transcription;
transmit the transcription to a client device for presentation to a user;
transmit the identifier to the client device with the transcription;
receive feedback on the transcription from the client device;
receive the identifier from the client device with the feedback on the transcription; and
based at least in part on the feedback, generate an updated language model.
1 Assignment
0 Petitions
Accused Products
Abstract
Audio data that includes speech may be transcribed using a language model. The transcription may be provided to a user. The user may provide feedback on the transcription, and the language model may be updated based at least in part on the feedback. The feedback may include, for example, an affirmation of the transcription; a disapproval of the transcription; a correction to the transcription; a selection of an alternate transcription result; or any other kind of response.
102 Citations
19 Claims
-
1. A system comprising:
-
an electronic data store configured to store; one or more algorithms that, when executed, implement an automatic speech recognition engine; and an initial language model; and a computing device in communication with the electronic data store, the computing device configured to; obtain audio data comprising speech; generate a transcription of the speech with the initial language model; generate an identifier associated with at least one of the audio data or the transcription; transmit the transcription to a client device for presentation to a user; transmit the identifier to the client device with the transcription; receive feedback on the transcription from the client device; receive the identifier from the client device with the feedback on the transcription; and based at least in part on the feedback, generate an updated language model. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer-readable medium having stored thereon a computer-executable component configured to execute in one or more processors of a computing device, the computer-executable component being further configured to:
-
receive first audio data comprising first speech; transcribe the first speech with a first language model to generate a first transcription; generate an identifier associated with at least one of the audio data or the first transcription; provide the first transcription to a first client device; provide the identifier to the first client device with the first transcription; receive feedback on the first transcription from the first client device; receive the identifier from the first client device with the feedback on the first transcription; and based at least in part on the feedback on the first transcription, update the first language model. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method comprising:
under control of one or more computing devices configured with specific computer-executable instructions, receiving audio data comprising speech; generating speech recognition results from the speech using a first language model; generating an identifier associated with at least one of the audio data or the speech recognition results; providing the speech recognition results to a first client device; providing the identifier to the first client device with the speech recognition results; receiving feedback on the speech recognition results from the first client device; receiving the identifier from the first client device with the feedback on the speech recognition results; and updating the first language model based at least in part on the feedback. - View Dependent Claims (15, 16, 17, 18, 19)
Specification