Use of intermediate speech transcription results in editing final speech transcription results
First Claim
1. A computer-implemented method comprising:
- receiving, at a user device, data representing text, the text comprising final speech transcription results and intermediate speech transcription results generated from an audio stream comprising an utterance;
at least temporarily displaying, via the user device, all of the intermediate speech transcription results in a list, wherein each newly-received intermediate transcription result is added to the list as it is received at the user device; and
displaying, via the user device, the final speech transcription results for viewing by a user.
5 Assignments
0 Petitions
Accused Products
Abstract
A communication system includes at least one transmitting device and at least one receiving device, one or more network systems for connecting the transmitting device to the receiving device, and an automatic speech recognition (“ASR”) system, including an ASR engine. A user speaks an utterance into the transmitting device, and the recorded speech audio is sent to the ASR engine. The ASR engine returns intermediate transcription results to the transmitting device, which displays the intermediate transcription results in real-time to the user. The intermediate transcription results are also correlated by utterance fragment to final transcription results and displayed to the user. The user may use the information thus presented to make decisions as to whether to edit the final transcription results or to speak the utterance again, thereby repeating the process. The intermediate transcription results may also be used by the user to edit the final transcription results.
-
Citations
21 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a user device, data representing text, the text comprising final speech transcription results and intermediate speech transcription results generated from an audio stream comprising an utterance; at least temporarily displaying, via the user device, all of the intermediate speech transcription results in a list, wherein each newly-received intermediate transcription result is added to the list as it is received at the user device; and displaying, via the user device, the final speech transcription results for viewing by a user.
-
-
2. A computer-implemented method comprising:
-
receiving, at a user device, data representing text, the text comprising final speech transcription results and intermediate speech transcription results generated from an audio stream comprising an utterance; at least temporarily displaying the intermediate speech transcription results via the user device; and displaying the final speech transcription results via the user device; wherein the intermediate speech transcription results are displayed via the user device substantially while the final speech transcription results are displayed. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium comprising a computer-executable component configured to be executed in one or more processors of a user device, the computer-executable component being further configured to:
-
receive speech via the user device; obtain one or more intermediate speech transcription results from the speech; cause the user device to display each of the one or more intermediate speech transcription results as it is obtained; obtain final transcription results from the speech; and upon obtaining the final transcription results from the speech, cause the user device to display concurrently both the one or more intermediate transcription results and the final transcription results. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
an electronic data store configured to store instructions, that when executed, implement an automatic speech recognition engine; and a computing device in communication with the electronic data store, the computing device configured to; receive speech; obtain, using the automatic speech recognition engine, one or more intermediate speech transcription results from the speech; display each of the one or more intermediate speech transcription results as it is obtained; obtain, using the automatic speech recognition engine, final transcription results from the speech; and upon obtaining the final transcription results from the speech, display concurrently both the one or more intermediate transcription results and the final transcription results. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification