USING WORD CONFIDENCE SCORE, INSERTION AND SUBSTITUTION THRESHOLDS FOR SELECTED WORDS IN SPEECH RECOGNITION
First Claim
1. A method for recognizing speech in acoustic data, comprising:
- generating at least one hypothetical word (HYP) in a decoder;
deriving a word confidence score (WCS) for each HYP; and
determining a modified hypothetical word (mHYP) for each HYP based on the HYP and the WCS for each HYP.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS'"'"'s value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
24 Citations
20 Claims
-
1. A method for recognizing speech in acoustic data, comprising:
-
generating at least one hypothetical word (HYP) in a decoder; deriving a word confidence score (WCS) for each HYP; and determining a modified hypothetical word (mHYP) for each HYP based on the HYP and the WCS for each HYP. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for recognizing speech in acoustic data, comprising:
-
performing a tuning phase, the tuning phase further comprising; generating a series of hypothetical words (HYP) from a tuning audio data set in a decoder; and setting values of tunable parameters in the decoder to minimize a weighted total error rate. - View Dependent Claims (16, 17)
-
-
18. A system for recognizing speech in acoustic data, comprising:
-
means for generating at least one hypothetical word (HYP) based on the acoustic data; means for determining a word confidence score (WCS) for each HYP; and evaluating means for outputting a modified hypothetical word (mHYP) for each HYP based on the HYP and the WCS. - View Dependent Claims (19, 20)
-
Specification