Enhanced stability prediction for incrementally generated speech recognition hypotheses
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving multiple, partial incremental speech recognition hypotheses that are each output by an incremental speech recognizer as a top partial incremental speech recognition hypothesis at a different point in time;
identifying a segment that occurs in a particular one of the multiple, partial incremental speech recognition hypotheses;
determining a quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment; and
assigning, by the one or more computers, a stability metric to the segment based on the quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting the stability of speech recognition results. In one aspect, a method includes determining a length of time, or a number of occasions, in which a word has remained in an incremental speech recognizer'"'"'s top hypothesis, and assigning a stability metric to the word based on the length of time or number of occasions.
-
Citations
34 Claims
-
1. A computer-implemented method comprising:
-
receiving multiple, partial incremental speech recognition hypotheses that are each output by an incremental speech recognizer as a top partial incremental speech recognition hypothesis at a different point in time; identifying a segment that occurs in a particular one of the multiple, partial incremental speech recognition hypotheses; determining a quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment; and assigning, by the one or more computers, a stability metric to the segment based on the quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment. - View Dependent Claims (2, 3, 5, 6, 7, 8, 9, 10, 11, 31, 32)
-
-
4. (canceled)
-
12. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, if executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving multiple, partial incremental speech recognition hypotheses that are each output by an incremental speech recognizer as a top partial incremental speech recognition hypothesis at a different point in time; identifying a segment that occurs in a particular one of the multiple, partial incremental speech recognition hypotheses; determining a quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment; and assigning, by the one or more computers, a stability metric to the segment based on the quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment. - View Dependent Claims (13, 14, 16, 17, 18, 20, 21, 33)
-
-
15. (canceled)
-
19. (canceled)
-
22. A computer-readable storage device storing software comprising instructions executable by one or more computers, which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving multiple, partial incremental speech recognition hypotheses that are each output by an incremental speech recognizer as a top partial incremental speech recognition hypothesis at a different point in time; identifying a segment that occurs in a particular one of the multiple, partial incremental speech recognition hypotheses; determining a quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment; and assigning, by the one or more computers, a stability metric to the segment based on the quantity of consecutive, partial incremental speech recognition hypotheses that (i) are output by the incremental speech recognizer as the top partial incremental speech recognition hypotheses at different points in time immediately after the one particular partial incremental speech recognition hypothesis is output, and (ii) include the segment. - View Dependent Claims (23, 24, 26, 27, 28, 29, 34)
-
-
25. (canceled)
-
30. (canceled)
Specification