Minimum bayesian risk methods for automatic speech recognition
First Claim
1. A method comprising:
- selecting, by a computing device, n hypothesis-space transcriptions of an utterance from a search graph that includes t>
n transcriptions of the utterance, wherein selecting the n hypothesis-space transcriptions comprises determining n best transcriptions of the utterance according to a maximum a posteriori (MAP) technique;
randomly selecting m evidence-space transcriptions of the utterance from the search graph, wherein t>
m;
for each particular hypothesis-space transcription of the n hypothesis-space transcriptions, calculating an expected word error rate by comparing the particular hypothesis-space transcription to the randomly selected m evidence-space transcriptions;
based on the expected word error rates, determining a lowest expected word error rate; and
providing the particular hypothesis-space transcription that is associated with the lowest expected word error rate.
2 Assignments
0 Petitions
Accused Products
Abstract
A hypothesis space of a search graph may be determined. The hypothesis space may include n hypothesis-space transcriptions of an utterance, each selected from a search graph that includes t>n transcriptions of the utterance. An evidence space of the search graph may also be determined. The evidence space may include m evidence-space transcriptions of the utterance that are randomly selected from the search graph, where t>m. For each particular hypothesis-space transcription in the hypothesis space, an expected word error rate may be calculated by comparing the particular hypothesis-space transcription to each of the evidence-space transcriptions. Based on the expected word error rates, a lowest expected word error rate may be obtained, and the particular hypothesis-space transcription that is associated with the lowest expected word error rate may be provided.
139 Citations
18 Claims
-
1. A method comprising:
-
selecting, by a computing device, n hypothesis-space transcriptions of an utterance from a search graph that includes t>
n transcriptions of the utterance, wherein selecting the n hypothesis-space transcriptions comprises determining n best transcriptions of the utterance according to a maximum a posteriori (MAP) technique;randomly selecting m evidence-space transcriptions of the utterance from the search graph, wherein t>
m;for each particular hypothesis-space transcription of the n hypothesis-space transcriptions, calculating an expected word error rate by comparing the particular hypothesis-space transcription to the randomly selected m evidence-space transcriptions; based on the expected word error rates, determining a lowest expected word error rate; and providing the particular hypothesis-space transcription that is associated with the lowest expected word error rate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An article of manufacture including a non-transitory computer-readable medium, having stored thereon program instructions that, upon execution by a computing device, cause the computing device to perform operations comprising:
-
selecting n hypothesis-space transcriptions of an utterance from a search graph that includes t>
n transcriptions of the utterance, wherein selecting the n hypothesis-space transcriptions comprises determining n best transcriptions of the utterance according to a maximum a posteriori (MAP) technique;randomly selecting m evidence-space transcriptions of the utterance from the search graph, wherein t>
m;for each particular hypothesis-space transcription of the selected n hypothesis-space transcriptions, calculating an expected word error rate by comparing the particular hypothesis-space transcription to the randomly selected m evidence-space transcriptions; based on the expected word error rates, determining a lowest expected word error rate; and providing the particular hypothesis-space transcription that is associated with the lowest expected word error rate. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computing device comprising:
-
at least one processor; data storage; and program instructions stored in the data storage that, when executed by the processor, cause the computing device to perform operations comprising; selecting n hypothesis-space transcriptions of an utterance from a search graph that includes t>
n transcriptions of the utterance, wherein selecting the n hypothesis-space transcriptions comprises determining n best transcriptions of the utterance according to a maximum a posteriori (MAP) technique;randomly selecting m evidence-space transcriptions of the utterance from the search graph, wherein t>
m;for each particular hypothesis-space transcription of the selected n hypothesis-space transcriptions, calculating an expected word error rate by comparing the particular hypothesis-space transcription to the randomly selected m evidence-space transcriptions; based on the expected word error rates, determining a lowest expected word error rate; and providing the particular hypothesis-space transcription that is associated with the lowest expected word error rate. - View Dependent Claims (17, 18)
-
Specification