Speech recognition with parallel recognition tasks
First Claim
1. A computer-implemented method comprising:
- receiving, at a computer system, an audio signal;
initiating, by the computer system, a plurality of speech recognition tasks for the audio signal, wherein the speech recognition tasks each use a different one of a plurality of language models;
detecting that a portion of the plurality of speech recognition tasks have completed, wherein a remaining portion of the plurality of speech recognition tasks have not completed;
obtaining recognition results and confidence values for each of the plurality of speech recognition tasks included in the portion, wherein the recognition results identify one or more candidate transcriptions of the audio signal, and the confidence values identify one or more probabilities that the recognition results are correct;
determining, by the computer system, whether at least one of the one or more confidence values is greater than or equal to a threshold confidence value; and
in response to determining that the at least one of the one or more confidence values is greater than or equal to the threshold confidence value and before all of the remaining portion of the plurality of speech recognition tasks have completed, providing a final recognition result for the audio signal based on the recognition results and the one or more confidence values.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS'"'"'s). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS'"'"'s that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
164 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a computer system, an audio signal; initiating, by the computer system, a plurality of speech recognition tasks for the audio signal, wherein the speech recognition tasks each use a different one of a plurality of language models; detecting that a portion of the plurality of speech recognition tasks have completed, wherein a remaining portion of the plurality of speech recognition tasks have not completed; obtaining recognition results and confidence values for each of the plurality of speech recognition tasks included in the portion, wherein the recognition results identify one or more candidate transcriptions of the audio signal, and the confidence values identify one or more probabilities that the recognition results are correct; determining, by the computer system, whether at least one of the one or more confidence values is greater than or equal to a threshold confidence value; and in response to determining that the at least one of the one or more confidence values is greater than or equal to the threshold confidence value and before all of the remaining portion of the plurality of speech recognition tasks have completed, providing a final recognition result for the audio signal based on the recognition results and the one or more confidence values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer system comprising:
-
one or more computing devices; an interface of the one or more computing devices that is programmed to receive an audio signal; and a plurality of speech recognition systems that initiate a plurality of speech recognition tasks for the audio signal, wherein the speech recognition tasks each use a different one of a plurality of language models; a recognition managing module that is programmed to; detect that a portion of the plurality of speech recognition tasks have completed, wherein a remaining portion of the plurality of speech recognition tasks have not completed, obtain recognition results and confidence values for each of the plurality of speech recognition tasks included in the portion, wherein the recognition results identify one or more candidate transcriptions of the audio signal, and the confidence values identify one or more probabilities that the recognition results are correct, determine whether at least one of the one or more confidence values is greater than or equal to a threshold confidence value, and in response to determining that the at least one of the one or more confidence values is greater than or equal to the threshold confidence value and before all of the remaining portion of the plurality of speech recognition tasks have completed, provide a final recognition result for the audio signal based on the recognition results and the one or more confidence values. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer program product embodied in a computer readable storage device storing instructions that, when executed, cause one or more computing devices to perform operations comprising:
-
receiving an audio signal; initiating a plurality of speech recognition tasks for the audio signal, wherein the speech recognition tasks each use a different one of a plurality of language models; detecting that a portion of the plurality of speech recognition tasks have completed, wherein a remaining portion of the plurality of speech recognition tasks have not completed; obtaining recognition results and confidence values for each of the plurality of speech recognition tasks included in the portion, wherein the recognition results identify one or more candidate transcriptions of the audio signal, and the confidence values identify one or more probabilities that the recognition results are correct; determining whether at least one of the one or more confidence values is greater than or equal to a threshold confidence value; and in response to determining that the at least one of the one or more confidence values is greater than or equal to the threshold confidence value and before all of the remaining portion of the plurality of speech recognition tasks have completed, providing a final recognition result for the audio signal based on the recognition results and the one or more confidence values.
-
Specification