System and method for latency reduction for automatic speech recognition using partial multi-pass results
First Claim
Patent Images
1. A method comprising:
- transcribing, via a processor, speech data using a first automatic speech recognition pass, which operates at a first transcription rate, to produce a first transcription data and a first word graph;
displaying a displayed part comprising an indication of a second automatic speech recognition pass which is forthcoming and at least part of the first transcription data corresponding to a portion of the speech data;
after displaying the displayed part, transcribing the speech data using the second automatic speech recognition pass, wherein the second automatic speech recognition pass uses the first word graph to produce second transcription data and a second word graph, and wherein the second automatic speech recognition pass is slower than the first automatic speech recognition pass; and
upon completing the second automatic speech recognition pass, updating the displayed part based at least in part on the second transcription data.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method is provided for reducing latency for automatic speech recognition. In one embodiment, intermediate results produced by multiple search passes are used to update a display of transcribed text.
-
Citations
20 Claims
-
1. A method comprising:
-
transcribing, via a processor, speech data using a first automatic speech recognition pass, which operates at a first transcription rate, to produce a first transcription data and a first word graph; displaying a displayed part comprising an indication of a second automatic speech recognition pass which is forthcoming and at least part of the first transcription data corresponding to a portion of the speech data; after displaying the displayed part, transcribing the speech data using the second automatic speech recognition pass, wherein the second automatic speech recognition pass uses the first word graph to produce second transcription data and a second word graph, and wherein the second automatic speech recognition pass is slower than the first automatic speech recognition pass; and upon completing the second automatic speech recognition pass, updating the displayed part based at least in part on the second transcription data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a processor; and a non-transitory computer-readable memory storing instructions which, when executed by the processor, cause the processor to perform a method comprising; transcribing speech data using a first automatic speech recognition pass, which operates at a first transcription rate, to produce a first transcription data and a first word graph; displaying a displayed part comprising an indication of a second automatic speech recognition pass which is forthcoming and at least part of the first transcription data corresponding to a portion of the speech data after displaying the displayed part, transcribing the speech data using the second automatic speech recognition pass, wherein the second automatic speech recognition pass uses the first word graph, wherein the second automatic speech recognition pass produces a second transcription data and a second word graph, and wherein the second automatic speech recognition pass is slower than the first automatic speech recognition pass; and upon completing the second automatic speech recognition pass, updating the displayed part based at least in part on the second transcription data. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to perform a method comprising:
-
transcribing speech data using a first automatic speech recognition pass, which operates at a first transcription rate, to produce a first transcription data and a first word graph; displaying a displayed part comprising an indication of a second automatic speech recognition pass which is forthcoming and at least part of the first transcription data corresponding to a portion of the speech data after displaying the displayed part, transcribing the speech data using the second automatic speech recognition pass, wherein the second automatic speech recognition pass uses the first word graph, wherein the second automatic speech recognition pass produces a second transcription data and a second word graph, and wherein the second automatic speech recognition pass is slower than the first automatic speech recognition pass; and upon completing the second automatic speech recognition pass, updating the displayed part based at least in part on the second transcription data. - View Dependent Claims (18, 19, 20)
-
Specification