Timely speech recognition
First Claim
1. A computer-implemented method, the computer-implemented method comprising:
- under control of a computing device configured with specific computer-executable instructions,receiving a first portion of audio data;
generating, with an automatic speech recognition engine, first transcribed text corresponding to the first portion of the audio data;
determining a confidence level for transcription accuracy of the first transcribed text;
transmitting the first transcribed text to a first device for presentation on the first device;
transmitting the confidence level to the first device, the confidence level associated with a cue for presentation on the first device, wherein the cue indicates the confidence level for transcription accuracy of the first transcribed text, and wherein the cue is distinct from the first transcribed text;
substantially while the first transcribed text is being presented on the first device,receiving a second portion of the audio data; and
generating, with the automatic speech recognition engine, second transcribed text corresponding to the first portion of the audio data and the second portion of the audio data; and
transmitting the second transcribed text to the first device for presentation on the first device.
2 Assignments
0 Petitions
Accused Products
Abstract
An automatic speech recognition engine may generate text or tokens that correspond to audio data. For example, the automatic speech recognition engine may generate first text or first speech tokens corresponding to a first portion of audio data. The automatic speech recognition engine may further generate second text or second speech tokens that correspond to a first portion of the audio data and a second portion of the audio data. The text or speech tokens generated by the automatic speech recognition engine may be provided to a device for presentation thereon. In some embodiments, the automatic speech recognition engine generates the second text or second speech tokens substantially while the first text or first speech tokens are presented on the device.
-
Citations
18 Claims
-
1. A computer-implemented method, the computer-implemented method comprising:
-
under control of a computing device configured with specific computer-executable instructions, receiving a first portion of audio data; generating, with an automatic speech recognition engine, first transcribed text corresponding to the first portion of the audio data; determining a confidence level for transcription accuracy of the first transcribed text; transmitting the first transcribed text to a first device for presentation on the first device; transmitting the confidence level to the first device, the confidence level associated with a cue for presentation on the first device, wherein the cue indicates the confidence level for transcription accuracy of the first transcribed text, and wherein the cue is distinct from the first transcribed text; substantially while the first transcribed text is being presented on the first device, receiving a second portion of the audio data; and generating, with the automatic speech recognition engine, second transcribed text corresponding to the first portion of the audio data and the second portion of the audio data; and transmitting the second transcribed text to the first device for presentation on the first device. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
an electronic data store configured to store one or more algorithms that, when executed, implement an automatic speech recognition engine; and a computing device in communication with the electronic data store, the computing device configured to; receive a first portion of audio data; generate, with the automatic speech recognition engine, first transcribed text corresponding to the first portion of the audio data, determine a first confidence level for transcription accuracy of the first transcribed text; transmit the first transcribed text to a first device for presentation on the first device; transmit the first confidence level to the first device, the first confidence level associated with a cue for presentation on the first device, wherein the cue indicates the first confidence level for transcription accuracy of the first transcribed text, and wherein the cue is distinct from the first transcribed text; substantially while the first transcribed text is presented on the first device, receive a second portion of the audio data; and generate, with the automatic speech recognition engine, second transcribed text corresponding to the first portion of the audio data and the second portion of the audio data; and transmit the second transcribed text to the first device for presentation on the first device. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium having stored thereon a computer-executable module configured to execute in one or more processors, the computer-executable module being further configured to:
-
obtain a first portion of audio data; transmit the first portion of the audio data to a remote computing device; receive, from the remote computing device, transcribed text corresponding to the first portion of the audio data; cause presentation of the transcribed text; receive, from the remote computing device, a first confidence level for transcription accuracy of the first transcribed text, the first confidence level associated with a first cue, wherein the first cue indicates the first confidence level for transcription accuracy of the first transcribed text, and wherein the first cue is distinct from the first transcribed text; cause presentation of the first cue; substantially while the first transcribed text is caused to be presented, obtain a second portion of the audio data; transmit the second portion of the audio data to the remote computing device; and receive, from the remote computing device, second transcribed text corresponding to the first portion of the audio data and the second portion of the audio data; and cause presentation of the second transcribed text. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification