Assisted speech recognition by dual search acceleration technique
First Claim
Patent Images
1. A speech recognition method, comprising:
- obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has yet to evaluate at this point in time; and
utilizing the partial results to alter the first speech recognition search process, wherein the first speech recognition search process proceeds from time frame to time frame at a different rate compared to the second speech recognition search process.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition method, system and program product, the method in one embodiment comprising: obtaining input speech data; initiating a first speech recognition search process with at least one hypothesis; initiating a second speech recognition search process with a plurality of hypotheses; obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has not evaluated at this point in time; and utilizing the partial results to alter the first speech recognition search process.
30 Citations
13 Claims
-
1. A speech recognition method, comprising:
-
obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has yet to evaluate at this point in time; and
utilizing the partial results to alter the first speech recognition search process, wherein the first speech recognition search process proceeds from time frame to time frame at a different rate compared to the second speech recognition search process. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A program product for speech recognition, comprising machine-readable program code for causing, when executed, a machine to perform the following method steps:
-
obtaining input speech data;
initiating a first speech recognition search process with at least one hypothesis;
initiating a second speech recognition search process with a plurality of hypotheses;
obtaining partial results from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has yet to evaluate at this point in time; and
utilizing the partial results to alter the first speech recognition search process, wherein the first speech recognition search process proceeds from time frame to time frame at a different rate compared to the second speech recognition search process. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A speech recognition method, comprising:
-
obtaining input speech data;
initiating a first speech recognition search process using a first parameter and a pruning threshold substantially simultaneously on a first plurality of hypotheses;
initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a second parameter for the second speech recognition search process is different from the first parameter for the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the priority queue search process has yet to evaluate at that point in time and a score for that hypothesis;
determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the first speech recognition search process has yet to evaluate and using a new pruning threshold, wherein a second parameter is a number of hypotheses being evaluated by the second speech recognition search process, which is larger than the number of hypotheses being evaluated the first speech recognition search process.
-
-
12. A speech recognition system, comprising:
-
a component for obtaining input speech data;
a component for initiating a first speech recognition search process using a first parameter and a pruning threshold substantially simultaneously on a first plurality of hypotheses;
a component for initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a parameter for the second speech recognition search process is different from the first parameter for the first speech recognition search process;
a component for obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has yet to evaluate at that point in time and a score for that hypothesis;
a component for determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
a component for, if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the first speech recognition search process has yet to evaluate and using a new pruning threshold, wherein a second parameter is a number of hypotheses being evaluated by the second speech recognition search process, which is larger than the number of hypotheses being evaluated by the first speech recognition search process.
-
-
13. A program product for speech recognition, comprising machine-readable program code for causing, when executed, a machine to perform the following method steps:
-
obtaining input speech data;
initiating a first speech recognition search process using a first parameter and a pruning threshold substantially simultaneously on a first plurality of hypotheses;
initiating a second speech recognition search process substantially simultaneously on a second plurality of hypotheses wherein a second parameter for the second speech recognition search process is different from the first parameter for the first speech recognition search process;
obtaining partial results at a point in time from the second speech recognition search process, where the partial results include an evaluation of at least one hypothesis that the first speech recognition search process has yet to evaluate at that point in time and a score for that hypothesis;
determining if the score for the at least one hypothesis evaluated by the second speech recognition search process for a particular time frame is better than a best score for that time frame for any hypothesis evaluated by the first speech recognition search process by more than a predetermined amount; and
if yes, then restarting the first speech recognition search process at that point in time on a plurality of hypotheses that include the at least one hypothesis that the first speech recognition search process has yet to evaluate and using a new pruning threshold, wherein a second parameter is a number of hypotheses being evaluated by the second speech recognition search process, which is larger than the number of hypotheses being evaluated by the first speech recognition search process.
-
Specification