Confidence calibration in automatic speech recognition systems
First Claim
1. A system comprising:
- one or more processors;
a memory coupled to the one or more processors;
a calibration model, dynamically selected by and implemented on the one or more processors based on a current condition, the calibration model having been trained for a usage scenario that corresponds to the current condition, the calibration model configured to receive a word confidence score and a semantic confidence score from a speech recognition engine, and configured to adjust the word confidence score to provide a calibrated word confidence score for use by an application, and further configured to adjust the semantic confidence score using the calibrated word confidence score to provide a calibrated semantic confidence score for use by the application, the calibration model having been trained for the usage scenario based upon a calibration training set obtained from at least one previous corresponding usage scenario.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a calibration model for use in a speech recognition system. The calibration model adjusts the confidence scores output by a speech recognition engine to thereby provide an improved calibrated confidence score for use by an application. The calibration model is one that has been trained for a specific usage scenario, e.g., for that application, based upon a calibration training set obtained from a previous similar/corresponding usage scenario or scenarios. Different calibration models may be used with different usage scenarios, e.g., during different conditions. The calibration model may comprise a maximum entropy classifier with distribution constraints, trained with continuous raw confidence scores and multi-valued word tokens, and/or other distributions and extracted features.
-
Citations
20 Claims
-
1. A system comprising:
-
one or more processors; a memory coupled to the one or more processors; a calibration model, dynamically selected by and implemented on the one or more processors based on a current condition, the calibration model having been trained for a usage scenario that corresponds to the current condition, the calibration model configured to receive a word confidence score and a semantic confidence score from a speech recognition engine, and configured to adjust the word confidence score to provide a calibrated word confidence score for use by an application, and further configured to adjust the semantic confidence score using the calibrated word confidence score to provide a calibrated semantic confidence score for use by the application, the calibration model having been trained for the usage scenario based upon a calibration training set obtained from at least one previous corresponding usage scenario. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. In a computing environment, a method comprising:
training a calibration model, implemented on one or more processors, for use in adjusting confidence scores output by a speech recognizer in a usage scenario, including processing a calibration training set corresponding to the usage scenario containing words, confidence scores, and labels indicating whether each word was correctly recognized, extracting features from the calibration training set corresponding to the usage scenario, the features including word and score distribution features, keyword coverage values, and at least one of sub-word units, semantics, or sentences, and using the features and continuous confidence scores to train the calibration model for the usage scenario, wherein the calibration model is dynamically selected based upon a current condition that corresponds to the usage scenario. - View Dependent Claims (15, 16, 17, 18, 19)
-
20. One or more computer storage devices having computer-executable instructions, which in response to execution by a computer, cause the computer to perform steps comprising:
-
dynamically selecting a calibration model based on a current usage scenario, the calibration model having been trained using data obtained from one or more previous usage scenarios corresponding to the current usage scenario; receiving a raw word confidence score and a raw semantic confidence score from a speech recognition engine at the calibration model; adjusting the raw word confidence score using continuous confidence scores to output a calibrated word confidence score for the current usage scenario; and adjusting the raw semantic confidence score using the calibrated word confidence score to output a calibrated semantic confidence score for the current usage scenario.
-
Specification