Speech recognition with parallel recognition tasks
First Claim
1. A computer-implemented method comprising:
- receiving an audio signal;
sending, by a computer system, a plurality of signals, the plurality of signals configured to initiate execution of a speech recognition task by each speech recognition system (SRS) from a plurality of SRS'"'"'s, each speech recognition task including generation of (i) a recognition result specifying possible speech included in the audio signal and (ii) a confidence value indicating a probability that the recognition result generated by that SRS is correct;
in response to detecting that a portion of the plurality of speech recognition tasks have completed by at least one SRS, generating, by the computer system, (i) one or more recognition results for each of the completed speech recognition tasks and (ii) one or more confidence values for each of the one or more recognition results;
determining whether each confidence value from the one or more confidence values for the completed portion of the speech recognition tasks meets a threshold for the at least one SRS;
in response to determining that the one or more confidence values for the completed portion of the speech recognition tasks meet the threshold for the at least one SRS, sending a signal configured to cause at least one other remaining SRS from the plurality of SRS'"'"'s to abort a speech recognition task that has not completed and that has not generated a confidence value; and
outputting a final recognition result based at least in part on at least one of the generated one or more recognition results from the at least one SRS.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS'"'"'s). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS'"'"'s that have not completed generating a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
257 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
receiving an audio signal; sending, by a computer system, a plurality of signals, the plurality of signals configured to initiate execution of a speech recognition task by each speech recognition system (SRS) from a plurality of SRS'"'"'s, each speech recognition task including generation of (i) a recognition result specifying possible speech included in the audio signal and (ii) a confidence value indicating a probability that the recognition result generated by that SRS is correct; in response to detecting that a portion of the plurality of speech recognition tasks have completed by at least one SRS, generating, by the computer system, (i) one or more recognition results for each of the completed speech recognition tasks and (ii) one or more confidence values for each of the one or more recognition results; determining whether each confidence value from the one or more confidence values for the completed portion of the speech recognition tasks meets a threshold for the at least one SRS; in response to determining that the one or more confidence values for the completed portion of the speech recognition tasks meet the threshold for the at least one SRS, sending a signal configured to cause at least one other remaining SRS from the plurality of SRS'"'"'s to abort a speech recognition task that has not completed and that has not generated a confidence value; and outputting a final recognition result based at least in part on at least one of the generated one or more recognition results from the at least one SRS. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-implemented method comprising:
-
receiving an audio signal; initiating speech recognition tasks by a plurality of speech recognition systems (SRS'"'"'s), each configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the recognition result; completing a portion of the speech recognition tasks by at least one SRS comprising generating one or more recognition results and one or more confidence values for the one or more recognition results; determining whether the one or more confidence values for the completed portion of the speech recognition tasks meets a confidence threshold for the at least one SRS; pausing a remaining portion of the speech recognition tasks for other remaining SRS'"'"'s that have not completed and that have not generated a confidence value if the one or more confidence values meets the confidence threshold for the at least one SRS; and outputting a final recognition result based on at least one of the generated one or more recognition results from the at least one SRS. - View Dependent Claims (18, 21)
-
-
19. A system comprising:
-
a plurality of speech recognition systems (SRS'"'"'s) that initiate execution of a plurality of speech recognition tasks to identify possible speech encoded in a received audio signal, each speech recognition task including generation of (i) a recognition result specifying possible speech included in the received audio signal and (ii) a confidence value indicating a probability that the recognition result generated by that SRS is correct; a recognition managing module to; receive, for a portion of the plurality of speech recognition tasks that have completed by at least one SRS, one or more recognition results and one or more confidence values associated with the one or more recognition results; determine whether each confidence value from the one or more confidence values for the completed portion of the speech recognition tasks meets a threshold for the at least one SRS; and in response to determining that the one or more confidence values for the completed portion of the speech recognition tasks meet the threshold for the at least one SRS, send a signal configured to cause at least one other remaining SRS from the plurality of SRS'"'"'s to abort a speech recognition task that has not completed and that has not generated a confidence value; and an interface for transmitting a final recognition result selected based at least in part on the one or more confidence values of the one or more recognition results from the at least one SRS.
-
-
20. A system comprising:
-
a plurality of speech recognition systems (SRS'"'"'s) that initiate execution of a plurality of speech recognition tasks to identify possible speech encoded in a received audio signal, each speech recognition task including generation of (i) a recognition result specifying possible speech included in the received audio signal and (ii) a confidence value indicating a probability that the recognition result generated by that SRS is correct; means for; receiving, for a portion of the plurality of speech recognition tasks that have completed by at least one SRS, one or more recognition results and one or more confidence values associated with the one or more recognition results; determine whether each confidence value from the one or more confidence values for the completed portion of the speech recognition tasks meets a threshold for the at least one SRS; and in response to determining that the one or more confidence values for the completed portion of the speech recognition tasks meet the threshold for the at least one SRS, sending a signal configured to cause at least one other remaining SRS from the plurality of SRS'"'"'s to abort a speech recognition task that has not completed and that has not generated a confidence value; and an interface for transmitting a final recognition result selected based at least in part on the one or more confidence values of the one or more recognition results from the at least one SRS, wherein the final recognition result represents possible speech within the received audio signal.
-
Specification