Tuning reusable software components in a speech application
First Claim
Patent Images
1. A method of tuning reusable dialog components within a speech application comprising:
- detecting speech recognition events generated from a plurality of speech recognitions, the plurality of speech recognitions performed by a speech recognition engine for a reusable dialog component that does not include any speech recognition engine, the reusable dialog component including a field and a confidence threshold that is associated with the field and specifies a minimally acceptable confidence score for any recognition result provided for the field, the field corresponding to a piece of information for which the speech application is configured to prompt a user, the speech recognition events being generated over a plurality of interactive voice response sessions;
re-prompting the user for the piece of information if a confidence score associated with a recognition result is below the confidence threshold, the recognition result being generated from a speech recognition performed by the speech recognition engine on a user utterance uttered in response to the speech application prompting the user for the piece of information, the confidence score being generated by the speech recognition engine, being associated with a speech recognition event generated from the speech recognition performed on the user utterance and indicating a confidence in an accuracy of the recognition result; and
automatically computing a suggested value for the confidence threshold by applying a statistical processing technique to confidence scores associated with a plurality of the speech recognition events, wherein a majority of the confidence scores for the plurality of the speech recognition events fall within a range having a low value and a high value, and wherein automatically computing the suggested value comprises computing a suggested value that is substantially equal to the low value for the range.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of tuning reusable dialog components within a speech application can include detecting speech recognition events generated from a plurality of recognitions performed for a field of a reusable dialog component. The speech recognition events can be generated over a plurality of interactive voice response sessions. The method also can include automatically computing a suggested value for a tuning parameter corresponding to the field of the reusable dialog component according, at least in part, to the speech recognition events.
43 Citations
18 Claims
-
1. A method of tuning reusable dialog components within a speech application comprising:
-
detecting speech recognition events generated from a plurality of speech recognitions, the plurality of speech recognitions performed by a speech recognition engine for a reusable dialog component that does not include any speech recognition engine, the reusable dialog component including a field and a confidence threshold that is associated with the field and specifies a minimally acceptable confidence score for any recognition result provided for the field, the field corresponding to a piece of information for which the speech application is configured to prompt a user, the speech recognition events being generated over a plurality of interactive voice response sessions; re-prompting the user for the piece of information if a confidence score associated with a recognition result is below the confidence threshold, the recognition result being generated from a speech recognition performed by the speech recognition engine on a user utterance uttered in response to the speech application prompting the user for the piece of information, the confidence score being generated by the speech recognition engine, being associated with a speech recognition event generated from the speech recognition performed on the user utterance and indicating a confidence in an accuracy of the recognition result; and automatically computing a suggested value for the confidence threshold by applying a statistical processing technique to confidence scores associated with a plurality of the speech recognition events, wherein a majority of the confidence scores for the plurality of the speech recognition events fall within a range having a low value and a high value, and wherein automatically computing the suggested value comprises computing a suggested value that is substantially equal to the low value for the range. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for tuning reusable dialog components within a speech application comprising:
-
at least one hardware processor that executes; at least one reusable dialog component that includes a field and a confidence threshold that is associated with the field and specifies a minimally acceptable confidence score for any recognition result provided for the field, the field corresponding to a piece of information for which the speech application is configured to prompt a user; a listener configured to detect speech recognition events generated during execution of the reusable dialog component, wherein the speech recognition events have a specific type and are associated with the field of the reusable dialog component, and configured to calculate a suggested value for the confidence threshold by applying a statistical processing technique to confidence scores associated with a plurality of the speech recognition events, wherein each of the plurality of speech recognition events comprises a recognition result generated by a speech recognition engine and a confidence score indicating a confidence in an accuracy of the recognition result, wherein a majority of the confidence scores for the plurality of the speech recognition events fall within a range having a low value and a high value, and wherein the listener is configured to calculate the suggested value by computing a suggested value that is substantially equal to the low value for the range; and a model configured to store the suggested value for the confidence threshold, wherein the speech application is configured to re-prompt the user for the piece of information if a confidence score associated with a recognition result is below the confidence threshold, the recognition result being provided by a speech recognition performed on a user utterance uttered in response to the speech application prompting the user for the piece of information, the confidence score being associated with a speech recognition event generated from the speech recognition performed on the user utterance. - View Dependent Claims (10, 11)
-
-
12. A tangible computer-readable medium, having stored thereon a computer program having a plurality of code sections for tuning reusable dialog components within a speech application, the computer-readable medium comprising:
-
code for detecting speech recognition events generated from a plurality of speech recognitions, the plurality of speech recognitions performed by a speech recognition engine for a reusable dialog component that does not include any speech recognition engine, the reusable dialog component including a field and a confidence threshold that is associated with the field and specifies a minimally acceptable confidence score for any recognition result provided for the field, the field corresponding to a piece of information for which the speech application is configured to prompt a user, the speech recognition events being generated over a plurality of interactive voice response sessions; code for re-prompting the user for the piece of information if a confidence score associated with a recognition result is below the confidence threshold, the recognition result being generated from a speech recognition performed by the speech recognition engine on a user utterance uttered in response to the speech application prompting the user for the piece of information, the confidence score being generated by the speech recognition engine, being associated with a speech recognition event generated from the speech recognition performed on the user utterance and indicating a confidence in an accuracy of the recognition result; and code for automatically computing a suggested value for the confidence threshold by applying a statistical processing technique to confidence scores associated with a plurality of the speech recognition events, wherein a majority of the confidence scores for the plurality of speech recognition events fall within a range having a low value and a high value, and wherein automatically computing the suggested value comprises computing a suggested value that is substantially equal to the low value for the range. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification