Multi-pass recognition of spoken dialogue
First Claim
Patent Images
1. A recognition system, comprising:
- a first speech recognizer to implement a first language model with an utterance from a user and to generate a first hypothesis;
a first confidence estimator to indicate a first confidence score based on the first hypothesis, the first confidence estimator being programmed with a first threshold level; and
a second speech recognizer to implement a second language model with the utterance and to generate a second hypothesis, the second hypothesis being determinative of an outcome of the system if the first confidence score is less than the first confidence threshold level.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of multi-pass recognition for conversational spoken dialogue systems includes two speech recognizers: a first recognizer that implements, for example, a statistical language model (SLM) and a second recognizer that implements, for example, a grammar-based model. A word-spotting speech recognizer may be included, as may confidence estimators for each speech recognizer. The system and method provide a multi-pass approach to speech recognition, which reevaluates speech inputs to improve recognition where confidence scores returned from confidence estimators are low.
147 Citations
43 Claims
-
1. A recognition system, comprising:
-
a first speech recognizer to implement a first language model with an utterance from a user and to generate a first hypothesis;
a first confidence estimator to indicate a first confidence score based on the first hypothesis, the first confidence estimator being programmed with a first threshold level; and
a second speech recognizer to implement a second language model with the utterance and to generate a second hypothesis, the second hypothesis being determinative of an outcome of the system if the first confidence score is less than the first confidence threshold level. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method of recognizing an utterance from a user, comprising:
-
processing the utterance through a first recognition pass;
generating a first sentence hypothesis by an implementation of a first language model during the first recognition pass;
indicating a first confidence score based upon a perceived accuracy of the first sentence hypothesis;
comparing the first confidence score to a first threshold level; and
processing the utterance through a second recognition pass if the first confidence score is less than the first threshold level. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A program code storage device, comprising:
-
a machine-readable storage medium; and
machine-readable program code, stored on the machine-readable storage medium, the machine-readable program code having instructions to;
process an utterance by a user through a first recognition pass, generate a first sentence hypothesis by an implementation of a first language model during the first recognition pass, indicate a first confidence score based upon a perceived accuracy of the first sentence hypothesis, compare the first confidence score to a first threshold level, and process the utterance through a second recognition pass if the first confidence score is less than the first threshold level. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
-
Specification