Corrective feedback loop for automated speech recognition
First Claim
1. A method for facilitating updating of a language model, the method comprising:
- as implemented by a client device configured with specific computer-executable instructions,receiving an audio message comprising speech of a user;
communicating the audio message to a first remote server;
receiving, from the first remote server,a transcription generated by an automatic speech recognition engine from the audio message; and
an alternative result matrix generated by the automatic speech recognition engine from the audio message;
receiving an affirmation of the transcription from the user;
storing the transcription with an identifier corresponding to the audio message; and
communicating the identifier and the transcription to a second remote server.
6 Assignments
0 Petitions
Accused Products
Abstract
A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.
92 Citations
26 Claims
-
1. A method for facilitating updating of a language model, the method comprising:
as implemented by a client device configured with specific computer-executable instructions, receiving an audio message comprising speech of a user; communicating the audio message to a first remote server; receiving, from the first remote server, a transcription generated by an automatic speech recognition engine from the audio message; and an alternative result matrix generated by the automatic speech recognition engine from the audio message; receiving an affirmation of the transcription from the user; storing the transcription with an identifier corresponding to the audio message; and communicating the identifier and the transcription to a second remote server. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A non-transitory computer-readable medium having a computer-executable component configured for execution by one or more processors of a client device the computer-executable component being further configured to:
-
receive an audio message comprising speech of a user; communicate the audio message to a first remote server; receive, from the first remote server, a transcription of the audio message generated by an automatic speech recognition engine; receive an affirmation of the transcription from the user; store the transcription with an identifier corresponding to the audio message; and communicate the transcription and the identifier to a second remote server, wherein the transcription and the identifier are communicated to the second remote server in response to at least one of a user instruction, an API call, or a next contact between the client device and the second remote server. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method for facilitating the updating of a language model, the method comprising:
-
receiving, at a client device, an audio message corresponding to speech of a user; communicating, to a first remote server, the audio message; receiving, at the client device, a result and alternative result matrix transcribed, at the first remote server using an automatic speech recognition engine, from the audio message; receiving, at the client device from the user, a manual correction of the result; storing, at the client device, the corrected result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A method for facilitating the updating of a language model, the method comprising:
-
receiving, at a client device, a first audio message corresponding to speech of a user; communicating, to a first remote server, the first audio message; receiving, at the client device, a first result, transcribed at the first remote server using an automatic speech recognition engine, from the first audio message; receiving, at the client device from the user, a disapproval of the first result; receiving, at the client device, a second audio message corresponding to speech of the user; communicating, to the first remote server, the second audio message; receiving, at the client device, a second result, transcribed at the first remote server using the automatic speech recognition engine, from the second audio message; receiving, at the client device from the user, an affirmation of the second result; storing, at the client device, the second result in association with an identifier corresponding to the second audio message; and communicating, to a second remote server, the stored second result together with the identifier. - View Dependent Claims (21)
-
-
22. A system comprising:
-
an electronic data store configured to store; one or more algorithms that, when executed, implement an automatic speech recognition engine; and a language model; and a computing device in communication with the electronic data store, the computing device configured to; receive an audio message from a client device, the audio message comprising speech; based at least in part on the language model, generate a transcription of the audio message with the automatic speech recognition engine; based at least in part on the language model, generate one or more alternate results for the audio message with the automatic speech recognition engine; transmit the transcription and the one or more alternate results to the client device; receive a response and an identifier of the audio message from the client device; and based at least in part on the response, update the language model to generate an updated language model. - View Dependent Claims (23, 24, 25, 26)
-
Specification