Transaction system based on a bidirectional speech channel by status graph building and problem detection for a human user
First Claim
1. A transaction system based on machine-based speech recognition, comprising a speech input, speech recognition means fed by said speech input, dialogue control means fed by said speech recognition means, and speech generation means fed by said dialogue control means for outputting question statements and verifier statements from a statement repertoire set, and for executing a human-machine dialogue with said system, until said dialogue control means has recognized a viable transaction formulation comprising a plurality of user-provided slot fillers for specifying said transaction,characterized in that said dialogue control means comprise with respect to a particular transaction:
- status graph building means for building one or more directed and loopless status graphs with nodes that each comprise a respective one of said slot fillers and an associated verifier metric, and which nodes are interrelated through logic combinations among their respective slot fillers, and said graph building means including amending means for amending a node'"'"'s metric, and discard means for under control of conflict detection or lowering of a particular node'"'"'s metric, discarding the node in question, including of all nodes derived from the so discarded node, and also one-to-one derival nodes of the discarded node,problem detector means for detecting a particular subset of problems associated with a particular statement outputted by said speech generation means regarding a particular slot filler item, as combined with an associated subsequent human speech unsatisfactory recognition result, and thereupon controlling said speech generation means for outputting a subsequent question statement for ascertaining the latter slot filler.
7 Assignments
0 Petitions
Accused Products
Abstract
A transaction system has machine recognition of speech. It has dialogue control fed by the recognition, and speech generation fed by the dialogue control for outputting question and verifier statements from a repertoire set. A human-machine dialogue is executed until the dialogue control has recognized a viable transaction formulation with a plurality of user-provided slot fillers to specify the transaction. Dialogue control builds a directed and loopless status graph with nodes that each have their own slot filler and associated metric, and are interrelated through logic relations. The building can amend a node'"'"'s metric and under control of conflict detection or lowering of a particular node'"'"'s metric, discard the node in question and its filler, including of derived nodes and also of one-to-one derival nodes of the discarded node. A subset of problems is detected for a particular item outputted by the speech generation for ascertaining a particular slot filler, as combined with a subsequent unsatisfactory recognition. Then the speech generation outputs a further question statement for ascertaining the latter slot filler.
90 Citations
14 Claims
-
1. A transaction system based on machine-based speech recognition, comprising a speech input, speech recognition means fed by said speech input, dialogue control means fed by said speech recognition means, and speech generation means fed by said dialogue control means for outputting question statements and verifier statements from a statement repertoire set, and for executing a human-machine dialogue with said system, until said dialogue control means has recognized a viable transaction formulation comprising a plurality of user-provided slot fillers for specifying said transaction,
characterized in that said dialogue control means comprise with respect to a particular transaction: -
status graph building means for building one or more directed and loopless status graphs with nodes that each comprise a respective one of said slot fillers and an associated verifier metric, and which nodes are interrelated through logic combinations among their respective slot fillers, and said graph building means including amending means for amending a node'"'"'s metric, and discard means for under control of conflict detection or lowering of a particular node'"'"'s metric, discarding the node in question, including of all nodes derived from the so discarded node, and also one-to-one derival nodes of the discarded node, problem detector means for detecting a particular subset of problems associated with a particular statement outputted by said speech generation means regarding a particular slot filler item, as combined with an associated subsequent human speech unsatisfactory recognition result, and thereupon controlling said speech generation means for outputting a subsequent question statement for ascertaining the latter slot filler. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for implementing a human-machine transaction based on machine recognizing of speech, said method comprising the steps of:
receiving and recognizing human speech, controlling a dialog on the basis of said recognized speech, outputting in the form of machine speech various question statements and verifier statements from a statement repertoire set, and executing a human-machine dialog until recognizing a viable transaction formulation comprising a plurality of user-provided slot fillers for specifying said transaction, being characterized in with respect to a particular transaction, comprising the steps of; building one or more directed and loopless status graphs with nodes that each comprise a respective one of said slot fillers and an associated verifier metric, whilst interrelating said nodes by logic combinations among their respective slot fillers, if applicable, amending said status graph as regarding a node'"'"'s metric and selectively, under control of conflict detection or lowering of a particular node'"'"'s metric, discarding the node in question, including of all nodes derived from the discarded node, and also one-to-one derival nodes of any discarded node, detecting a particular subset of problems associated with a particular said statement regarding a particular slot filler, as combined with an associated subsequent unsatisfactory human speech recognition result, and thereupon outputting a subsequent question statement for ascertaining the latter slot filler. - View Dependent Claims (9, 10, 11, 12, 13, 14)
Specification