Dual search acceleration technique for speech recognition
First Claim
Patent Images
1. A speech recognition method, comprising:
- obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and
utilizing the partial results to alter the first speech recognition search process.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition method, system and program product, the method in one embodiment comprising: obtaining input speech data; initiating a first speech recognition search process with at least one hypothesis; initiating a second speech recognition search process with a plurality of hypotheses; obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and utilizing the partial results to alter the first speech recognition search process.
57 Citations
28 Claims
-
1. A speech recognition method, comprising:
-
obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and
utilizing the partial results to alter the first speech recognition search process. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for speech recognition, comprising:
-
a component for obtaining input speech data;
a component for performing a first speech recognition search process with at least one hypothesis;
a component for performing a second speech recognition search process with a plurality of hypotheses;
a component for obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and
a component for utilizing the partial results to alter the first speech recognition search process.
-
-
8. A program product for speech recognition, comprising machine-readable program code for causing, when executed, a machine to perform the following method steps:
-
obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and
utilizing the partial results to alter the first speech recognition search process. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A speech recognition method, comprising:
-
obtaining input speech data;
initiating a priority queue best first speech recognition search process using a pruning threshold on a best first hypothesis selected from a plurality of hypotheses ranked in an order;
initiating a second speech recognition search process substantially simultaneously on a plurality of hypotheses that includes at least one hypothesis that is not evaluated by the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at the point in time and a score for that hypothesis;
adding a hypothesis to the plurality of hypotheses in the priority queue or reordering the hypotheses in the priority queue based on the partial results;
changing the pruning threshold for the priority queue speech recognition process for each point in time that the second process has evaluated; and
if a criteria is met, selecting a new best first hypothesis and initiating the new priority queue best first speech recognition search process using the new best first search hypothesis at the point in time where the partial results were taken.
-
-
15. A speech recognition system, comprising:
-
a component for obtaining input speech data;
a component for initiating a priority queue best first speech recognition search process using a pruning threshold on a best first hypothesis selected from a plurality of hypotheses ranked in an order;
a component for initiating a second speech recognition search process substantially simultaneously on a plurality of hypotheses that includes at least one hypothesis that is not evaluated by the first speech recognition search process;
a component for obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at the point in time and a score for that hypothesis;
a component for adding a hypothesis to the plurality of hypotheses in the priority queue or reordering the hypotheses in the priority queue based on the partial results;
a component for changing the pruning threshold for the priority queue speech recognition process for each point in time that the second process has evaluated; and
a component for, if a criteria is met, selecting a new best first hypothesis and initiating the new priority queue best first speech recognition search process using the new best first search hypothesis at the point in time where the partial results were taken.
-
-
16. A program product for speech recognition, comprising machine-readable program code for, when executed, causing a machine to perform the following method steps:
-
obtaining input speech data;
initiating a priority queue best first speech recognition search process using a pruning threshold on a best first hypothesis selected from a plurality of hypotheses ranked in an order;
initiating a second speech recognition search process substantially simultaneously on a plurality of hypotheses that includes at least one hypothesis that is not evaluated by the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at the point in time and a score for that hypothesis;
adding a hypothesis to the plurality of hypotheses in the priority queue or reordering the hypotheses in the priority queue based on the partial results;
changing the pruning threshold for the priority queue speech recognition process for each point in time that the second process has evaluated; and
if a criteria is met, selecting a new best first hypothesis and initiating the new priority queue best first speech recognition search process using the new best first search hypothesis at the point in time where the partial results were taken.
-
-
17. A speech recognition method, comprising:
-
obtaining input speech data;
initiating a first speech recognition search process using a pruning threshold substantially simultaneously on a first plurality of hypothesis;
initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a parameter for the process is different from that parameter in the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at that point in time and a score for that hypothesis;
determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the priority queue search process has not evaluated and using a new pruning threshold. - View Dependent Claims (18, 19, 20)
-
-
21. A speech recognition system, comprising:
-
a component for obtaining input speech data;
a component for initiating a first speech recognition search process using a pruning threshold substantially simultaneously on a first plurality of hypothesis;
a component for initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a parameter for the process is different from that parameter in the first speech recognition search process;
a component for obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at that point in time and a score for that hypothesis;
a component for determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
a component for, if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the priority queue search process has not evaluated and using a new pruning threshold. - View Dependent Claims (22, 23, 24)
-
-
25. A program product for speech recognition, comprising machine-readable program code for causing, when executed, a machine to perform the following method steps:
-
obtaining input speech data;
initiating a first speech recognition search process using a pruning threshold substantially simultaneously on a first plurality of hypothesis;
initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a parameter for the process is different from that parameter in the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has not evaluated at that point in time and a score for that hypothesis;
determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the priority queue search process has not evaluated and using a new pruning threshold. - View Dependent Claims (26, 27, 28)
-
Specification