System and method of providing an automated data-collection in spoken dialog systems
First Claim
1. A method comprising:
- training, via a processor, an automatic speech recognition module, a spoken language understanding module and a dialog management module using task-independent call-types of a previous application;
recognizing a received user utterance in response to a prompt to a user using the automatic speech recognition module to yield a recognized user utterance;
classifying the recognized user utterance using the spoken language understanding module to yield a classification;
if the recognized user utterance is classifiable by the spoken language understanding module with a confidence more than an acceptance threshold, then acting according to a call-type associated with the classification; and
if the recognized user utterance is not classifiable by the spoken language understanding module to a predetermined rejection threshold and if a dialog-wide re-prompt counter is exceeded, then transferring the user to a human, wherein the received user utterance is transcribed, labeled and used for training the spoken dialog system, andwherein at least one of the acceptance threshold and the predetermined rejection threshold are related to an entity referenced in the recognized user utterance.
4 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.
-
Citations
29 Claims
-
1. A method comprising:
-
training, via a processor, an automatic speech recognition module, a spoken language understanding module and a dialog management module using task-independent call-types of a previous application; recognizing a received user utterance in response to a prompt to a user using the automatic speech recognition module to yield a recognized user utterance; classifying the recognized user utterance using the spoken language understanding module to yield a classification; if the recognized user utterance is classifiable by the spoken language understanding module with a confidence more than an acceptance threshold, then acting according to a call-type associated with the classification; and if the recognized user utterance is not classifiable by the spoken language understanding module to a predetermined rejection threshold and if a dialog-wide re-prompt counter is exceeded, then transferring the user to a human, wherein the received user utterance is transcribed, labeled and used for training the spoken dialog system, and wherein at least one of the acceptance threshold and the predetermined rejection threshold are related to an entity referenced in the recognized user utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
a processor; means for controlling the processor to obtain task-independent training data; means for controlling the processor to present a prompt to a user; means for controlling the processor to recognize a received user utterance in response to the prompt using an automatic speech recognition engine to yield a recognized user utterance; means for controlling the processor to classify the recognized user utterance using a spoken language understanding module; means for controlling the processor to prompt the user if the recognized user utterance is classifiable by the spoken language understanding module with a confidence more than a predetermined acceptance threshold; and means for controlling the processor to transfer the user to a human if the recognized user utterance is not classifiable by the spoken language understanding module to a predetermined rejection threshold and if a dialog-wide re-prompt counter is exceeded, wherein the received user utterance is transcribed, labeled and used for training the spoken dialog system, and wherein at least one of the acceptance threshold and the predetermined rejection threshold are related to an entity referenced in the recognized user utterance. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for gathering data for use in training a spoken dialog system, the system comprising:
-
a processor; and a computer-readable storage medium storing instructions for controlling the processor to perform a method comprising; determining task-independent training data from a library formed using data from previous applications; recognizing a received user utterance in response to a prompt to a user using an automatic speech recognition engine to yield a recognized user utterance; classifying the recognized user utterance using a spoken language understanding module; and transferring the user to a human if the recognized user utterance is not classifiable by the spoken language understanding module to a predetermined rejection threshold and if a dialog-wide re-prompt counter is exceeded, wherein the received user utterance is transcribed, labeled and used for training the spoken dialog system, adjusting at least one of the acceptance threshold and the predetermined rejection threshold is based on an entity referenced in the recognized user utterance.
-
-
19. A non-transitory computer-readable medium storing instructions for controlling a computing device to gather data for use in training a spoken dialog system, the instructions comprising:
-
managing a library of training data obtained from previous applications; recognizing a received user utterance in response to a prompt to a user using an automatic speech recognition engine to yield a recognized user utterance; classifying the recognized user utterance using a spoken language understanding module; and if the recognized user utterance is not classifiable by the spoken language understanding module to a predetermined rejection threshold and if a dialog-wide re-prompt counter is exceeded, then transferring the user to a human, wherein the received user utterance is transcribed, labeled and used for training the spoken dialog system, wherein the predetermined rejection threshold is related to an entity referenced in the recognized user utterance. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A method comprising:
-
presenting a prompt to a user in a call; recognizing via a processor an utterance received in response to the prompt using an automatic speech recognition module; classifying the utterance using a generic spoken language understanding module; determining whether the recognition and classification meet a threshold; if at least one of the recognition by the automatic speech recognition module and classification by the spoken language understanding module does not meet the threshold and if a dialog-wide re-prompt counter is exceeded, then transferring the user to a human agent wherein the utterance is transcribed, labeled and used for training the spoken dialog system; and if at least one of the recognition and classification does meet the threshold, then routing the call according to a call-type associated with the classification, wherein the threshold is related to an entity referenced in the utterance. - View Dependent Claims (26, 27, 28, 29)
-
Specification