Targeted detection of regions in speech processing data streams
First Claim
Patent Images
1. A method performed by a speech recognition processing component, the method comprising:
- receiving first audio data;
determining, using the first audio data, speech processing results;
determining second data indicating that the speech processing results include a first incorrect portion;
determining third audio data as corresponding to the first incorrect portion, wherein the third audio data includes at least a portion of the first audio data;
after determining third audio data corresponding to the first incorrect position, generating an indicator associated with the third audio data; and
sending the third audio data, the indicator and the first incorrect portion to a speech recognition training component.
0 Assignments
0 Petitions
Accused Products
Abstract
In speech processing systems, a special audio trigger indication is configured to efficiently isolate and mark incorrect speech processing results. The trigger indication may be configured to be easily recognizable by a speech processing device under various ASR and acoustic conditions. Once a speech processing device recognizes the trigger indication, incorrectly processed speech processing results are marked and may be isolated and prioritized for review by training and upgrading processes.
22 Citations
21 Claims
-
1. A method performed by a speech recognition processing component, the method comprising:
-
receiving first audio data; determining, using the first audio data, speech processing results; determining second data indicating that the speech processing results include a first incorrect portion; determining third audio data as corresponding to the first incorrect portion, wherein the third audio data includes at least a portion of the first audio data; after determining third audio data corresponding to the first incorrect position, generating an indicator associated with the third audio data; and sending the third audio data, the indicator and the first incorrect portion to a speech recognition training component. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computing system, comprising:
-
at least one processor; memory including instructions operable to be executed by the at least one processor to perform a set of actions, configuring the computing system to; receive first audio data; determine, using the first audio data, speech processing results; determine second data indicating that the speech processing results include a first incorrect portion; determine third audio data as corresponding to the first incorrect portion, wherein the third audio data includes at least a portion of the first audio data; after determining third audio data corresponding to the first incorrect position, generate an indicator associated with the third audio data; and send the third audio data, the indicator and the first incorrect portion to a speech recognition training component. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium storing non-transitory processor-executable instructions for controlling a computing system, comprising:
-
program code to receive first audio data; program code to determine, using the first audio data, speech processing results; program code to determine second data indicating that the speech processing results include a first incorrect portion; program code to determine third audio data as corresponding to the first incorrect portion, wherein the third audio data includes at least a portion of the first audio data; program code to, after determining third audio data corresponding to the first incorrect position, generate an indicator associated with the third audio data; and program code to send the third audio data, the indicator and the first incorrect portion to a speech recognition training component. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification