Automated speech recognition proxy system for natural language understanding
First Claim
1. A computer-implemented system for processing an interaction, the interaction including an utterance requiring recognition before being usable for further computer-implemented processing, the system comprising:
- an application configured to provide the utterance;
a proxy processing subsystem in communication with the application, the proxy processing subsystem configured to receive the utterance and initiate processing thereof;
a recognition decision engine configured to receive from the proxy processing system an utterance for recognition and ancillary information regarding the utterance for recognition, the recognition decision engine selecting, responsive to the ancillary information, one or more recognizers from a first type of recognizer subsystems and a second type of recognizer subsystems; and
a results decision engine operably coupled with the one or more recognizers and configured to return to the proxy processing subsystem a recognition result, the results decision engine further automatically updating a statistics database responsive to results of processing by the one or more recognizers.
22 Assignments
0 Petitions
Accused Products
Abstract
An interactive response system mixes HSR subsystems with ASR subsystems to facilitate overall capability of voice user interfaces. The system permits imperfect ASR subsystems to nonetheless relieve burden on HSR subsystems. An ASR proxy is used to implement an IVR system, and the proxy decides, based on a set of rules, to route an utterance solely to one ASR, to route it to an HSR in addition to at least one ASR, to route it solely to one or more HSR subsystem, to reroute an utterance originally sent to an ASR over to an HSR, to use HSRs to help tune and train one or more ASRs, and to use multiple ASRs to increase reliability of results.
64 Citations
20 Claims
-
1. A computer-implemented system for processing an interaction, the interaction including an utterance requiring recognition before being usable for further computer-implemented processing, the system comprising:
-
an application configured to provide the utterance; a proxy processing subsystem in communication with the application, the proxy processing subsystem configured to receive the utterance and initiate processing thereof; a recognition decision engine configured to receive from the proxy processing system an utterance for recognition and ancillary information regarding the utterance for recognition, the recognition decision engine selecting, responsive to the ancillary information, one or more recognizers from a first type of recognizer subsystems and a second type of recognizer subsystems; and a results decision engine operably coupled with the one or more recognizers and configured to return to the proxy processing subsystem a recognition result, the results decision engine further automatically updating a statistics database responsive to results of processing by the one or more recognizers. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method for processing an interaction comprising:
-
receiving data representing an utterance from a computer application; using a proxy processor, automatically initiating processing of the data; selecting, using ancillary information regarding the utterance, one or more recognizers from a first type of recognizer subsystems and a second type of recognizer subsystems; returning a recognition result responsive to results of processing by the one or more recognizers; and automatically updating a statistics database responsive to results of processing by the one or more recognizers. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for processing an interaction, the interaction including an utterance requiring recognition before being usable for further computer-implemented processing, the system comprising non-transitory computer storage media storing programming instructions executable by at least one processor to:
-
receive data representing an utterance from a computer application; using a proxy processor, automatically initiate processing of the data; select, using ancillary information regarding the utterance, one or more recognizers from a first type of recognizer subsystems and a second type of recognizer subsystems; return a recognition result responsive to results of processing by the one or more recognizers; and automatically update a statistics database responsive to results of processing by the one or more recognizers. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification