Speech recognition with parallel recognition tasks
First Claim
1. A computer-implemented method comprising:
- receiving, at a computer system, an audio signal;
initiating, by the computer system, a plurality of speech recognition tasks for the audio signal, the plurality of speech recognition tasks running on a plurality of speech recognition systems;
detecting that a portion of the plurality of the speech recognition systems have completed their respective speech recognition tasks which comprise a completed portion of the plurality of speech recognition tasks, wherein a remaining portion of the plurality of the speech recognition systems have not completed their respective speech recognition tasks and are still processing their respective speech recognition tasks;
obtaining recognition results and confidence values for the completed portion of the plurality of speech recognition tasks, wherein the recognition results identify one or more candidate representations of the audio signal and the confidence values identify one or more probabilities that the recognition results are correct;
generating one or more combined confidence values for the recognition results based on the recognition results and the confidence values for the completed portion of the plurality of speech recognition tasks;
determining, by the computer system, whether at least one of the one or more combined confidence values is greater than or equal to a threshold confidence value; and
in response to determining that the at least one of the one or more combined confidence values is greater than or equal to the threshold confidence value and before the remaining portion of the plurality of the speech recognition systems have completed their respective speech recognition tasks, providing a final recognition result for the audio signal based on the recognition results and the one or more combined confidence values.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS'"'"'s). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS'"'"'s that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
60 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a computer system, an audio signal; initiating, by the computer system, a plurality of speech recognition tasks for the audio signal, the plurality of speech recognition tasks running on a plurality of speech recognition systems; detecting that a portion of the plurality of the speech recognition systems have completed their respective speech recognition tasks which comprise a completed portion of the plurality of speech recognition tasks, wherein a remaining portion of the plurality of the speech recognition systems have not completed their respective speech recognition tasks and are still processing their respective speech recognition tasks; obtaining recognition results and confidence values for the completed portion of the plurality of speech recognition tasks, wherein the recognition results identify one or more candidate representations of the audio signal and the confidence values identify one or more probabilities that the recognition results are correct; generating one or more combined confidence values for the recognition results based on the recognition results and the confidence values for the completed portion of the plurality of speech recognition tasks; determining, by the computer system, whether at least one of the one or more combined confidence values is greater than or equal to a threshold confidence value; and in response to determining that the at least one of the one or more combined confidence values is greater than or equal to the threshold confidence value and before the remaining portion of the plurality of the speech recognition systems have completed their respective speech recognition tasks, providing a final recognition result for the audio signal based on the recognition results and the one or more combined confidence values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer system comprising:
-
one or more computing devices; an interface of the one or more computing devices that is programmed to receive an audio signal; and a plurality of speech recognition systems that initiate a plurality of speech recognition tasks for the audio signal; a recognition managing module that is programmed to; detect that a portion of the plurality of speech recognition systems have completed their respective speech recognition tasks which comprise a completed portion of the plurality of speech recognition tasks, wherein a remaining portion of the plurality of speech recognition systems have not completed their respective speech recognition tasks and are still processing their respective speech recognition tasks, obtain recognition results and confidence values for the completed portion of the plurality of speech recognition tasks, wherein the recognition results identify one or more candidate representations of the audio signal and the confidence values identify one or more probabilities that the recognition results are correct, generate one or more combined confidence values for the recognition results based on the recognition results and the confidence values for the completed portion of the plurality of speech recognition tasks, determine whether at least one of the one or more combined confidence values is greater than or equal to a threshold confidence value, and in response to determining that the at least one of the one or more combined confidence values is greater than or equal to the threshold confidence value and before the remaining portion of the plurality of speech recognition systems have completed their respective speech recognition tasks, provide through the interface a final recognition result for the audio signal based on the recognition results and the one or more combined confidence values. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer program product embodied in a computer readable storage device storing instructions that, when executed, cause one or more computing devices to perform operations comprising:
-
receiving an audio signal; initiating a plurality of speech recognition tasks for the audio signal, the plurality of speech recognition tasks running on a plurality of speech recognitions systems; detecting that a portion of the plurality of the speech recognition systems have completed their respective speech recognition tasks and have yielded a completed portion of the plurality of speech recognition tasks, wherein a remaining portion of the plurality of the speech recognition systems have not completed their respective speech recognition tasks and are still processing their respective speech recognition tasks; obtaining recognition results and confidence values for the completed portion of the plurality of speech recognition tasks, wherein the recognition results identify one or more candidate representations of the audio signal and the confidence values identify one or more probabilities that the recognition results are correct; generating one or more combined confidence values for the recognition results based on the recognition results and the confidence values for the completed portion of the plurality of speech recognition tasks; determining whether at least one of the one or more combined confidence values is greater than or equal to a threshold confidence value; and in response to determining that the at least one of the one or more combined confidence values is greater than or equal to the threshold confidence value and before the remaining portion of the plurality of speech recognition systems have completed their respective speech recognition tasks, providing a final recognition result for the audio signal based on the recognition results and the one or more combined confidence values.
-
Specification