Method and apparatus for predicting word error rates from text
First Claim
Patent Images
1. A method of modeling a speech recognition system, the method comprising:
- decoding a speech signal produced from a training text, the training text comprising a sequence of actual speech units to produce a sequence of predicted speech units;
constructing a confusion model based on the sequence of actual speech units and the sequence of predicted speech units; and
decoding a test text using the confusion model and a language model to generate at least one model-predicted sequence of speech units.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of modeling a speech recognition system includes decoding a speech signal produced from a training text to produce a sequence of predicted speech units. The training text comprises a sequence of actual speech units that is used with the sequence of predicted speech units to form a confusion model. In further embodiments, the confusion model is used to decode a text to identify an error rate that would be expected if the speech recognition system decoded speech based on the text.
41 Citations
23 Claims
-
1. A method of modeling a speech recognition system, the method comprising:
-
decoding a speech signal produced from a training text, the training text comprising a sequence of actual speech units to produce a sequence of predicted speech units;
constructing a confusion model based on the sequence of actual speech units and the sequence of predicted speech units; and
decoding a test text using the confusion model and a language model to generate at least one model-predicted sequence of speech units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
decoding a test text comprising actual sequences of speech units to produce sequences of predicted speech units using a confusion model that provides likelihoods for sequences of predicted speech units given sequences of actual speech units; and
determining an error rate based on a sequence of predicted speech units and the sequence of actual speech units. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification