Method for estimating a confidence measure for a speech recognition system
First Claim
1. A method of estimating a confidence measure for a speech recognition system, the method comprising the steps of:
- receiving an input utterance;
comparing the input utterance with a plurality of predetermined models of possible utterances to provide a plurality of scores indicating a degree of similarity between the input utterance and the plurality of predetermining models;
determining a variance of a predetermined number of the plurality of scores; and
normalizing the variance to provide a confidence measure for a likely recognition result for the in put utterance, wherein the confidence measure (CM) is calculated by;
where CM is the confidence measure, N is a predetermined number of N-Best scores, Si is the i-th best score, and μ
is the mean calculated by;
4 Assignments
0 Petitions
Accused Products
Abstract
A method of estimating a confidence measure for a speech recognition system, involves comparing an input speech signal with a number of predetermined models of possible speech signals. Best scores indicating the degree of similarity between the input speech signal and each of the predetermined models are then used to determine a normalized variance, which is used as the Confidence Measure, in order to determine whether the input speech signal has been correctly recognized, the Confidence Measure is compared to a threshold value. The threshold value is weighted according to the Signal to Noise Ratio of the input speech signal and according to the number of predetermined models used.
-
Citations
10 Claims
-
1. A method of estimating a confidence measure for a speech recognition system, the method comprising the steps of:
-
receiving an input utterance;
comparing the input utterance with a plurality of predetermined models of possible utterances to provide a plurality of scores indicating a degree of similarity between the input utterance and the plurality of predetermining models;
determining a variance of a predetermined number of the plurality of scores; and
normalizing the variance to provide a confidence measure for a likely recognition result for the in put utterance, wherein the confidence measure (CM) is calculated by;
where CM is the confidence measure, N is a predetermined number of N-Best scores, Si is the i-th best score, and μ
is the mean calculated by;
-
-
2. A method of determining whether an input utterance to a speech recognition system is correctly recognized by the system the method comprising the steps of:
-
determining a likely recognition result for an input utterance;
comparing the input utterance with a plurality of predetermined models of possible utterances to provide a plurality of scores indicating a degree of similarity between the input utterance and the plurality of predetermined models;
determining a variance of a predetermined number of the plurality of scores;
normalizing the variance to provide an estimated confidence measure for a likely recognition result for the input utterance;
determining a threshold comprising weighting the threshold depending on the noise level in an input signal containing the input utterance;
comparing the threshold with the confidence measure; and
accepting or rejecting the recognition result according to whether the confidence measure is above or below the threshold.- View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10)
-
8. A method according to claim 2, wherein the step of determining a threshold comprises weighting the threshold depending on the number of predetermined models that the input utterance is compared with.
-
9. A method according to claim 8, wherein the weighting (W) is given by
-
= α + β × - VS / γ where the number of predetermined models (VS) is 2 or more.
-
-
10. A method according to claim 9, wherein
α - =0.6;
β
=1.08; and
γ
=10.0.
- =0.6;
-
Specification