Semantic re-ranking of NLU results in conversational dialogue applications
First Claim
1. A method comprising:
- receiving, by a computing system, data generated based on spoken user responses to prompts associated with a dialogue, the data comprising a portion corresponding to one or more of the spoken user responses spoken before a subsequently spoken response of the spoken user responses;
generating, by at least one processor of the computing system, a list of natural language understanding (NLU)-ranked semantic interpretations for the subsequently spoken response;
determining, by the computing system and based on the portion corresponding to the one or more of the spoken user responses spoken before the subsequently spoken response, a plurality of key-value pairs corresponding to different possible resolutions for an anaphora in the subsequently spoken response; and
re-ranking, by the computing system and based on the plurality of key-value pairs, the list of NLU-ranked semantic interpretations to identify a semantic interpretation that resolves the anaphora in the subsequently spoken response based on a context of the dialogue determined from the one or more of the spoken user responses spoken before the subsequently spoken response.
2 Assignments
0 Petitions
Accused Products
Abstract
A human-machine dialog system is described which has multiple computer-implemented dialog components. A user client delivers output prompts to a human user and receives dialog inputs from the human user including speech inputs. An automatic speech recognition (ASR) engine processes the speech inputs to determine corresponding sequences of representative text words. A natural language understanding (NLU) engine processes the text words to determine corresponding NLU-ranked semantic interpretations. A semantic re-ranking module re-ranks the NLU-ranked semantic interpretations based on at least one of dialog context information and world knowledge information. A dialog manager responds to the re-ranked semantic interpretations and generates the output prompts so as to manage a dialog process with the human user.
-
Citations
17 Claims
-
1. A method comprising:
-
receiving, by a computing system, data generated based on spoken user responses to prompts associated with a dialogue, the data comprising a portion corresponding to one or more of the spoken user responses spoken before a subsequently spoken response of the spoken user responses; generating, by at least one processor of the computing system, a list of natural language understanding (NLU)-ranked semantic interpretations for the subsequently spoken response; determining, by the computing system and based on the portion corresponding to the one or more of the spoken user responses spoken before the subsequently spoken response, a plurality of key-value pairs corresponding to different possible resolutions for an anaphora in the subsequently spoken response; and re-ranking, by the computing system and based on the plurality of key-value pairs, the list of NLU-ranked semantic interpretations to identify a semantic interpretation that resolves the anaphora in the subsequently spoken response based on a context of the dialogue determined from the one or more of the spoken user responses spoken before the subsequently spoken response. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
at least one processor; and a memory storing instructions that when executed by the at least one processor cause the system to; receive data generated based on spoken user responses to prompts associated with a dialogue, the data comprising a portion corresponding to one or more of the spoken user responses spoken before a subsequently spoken response of the spoken user responses; generate a list of natural language understanding (NLU)-ranked semantic interpretations for the subsequently spoken response; determine, based on the portion corresponding to the one or more of the spoken user responses spoken before the subsequently spoken response, a plurality of key-value pairs corresponding to different possible resolutions for an anaphora in the subsequently spoken response; and re-rank, based on the plurality of key-value pairs, the list of NLU-ranked semantic interpretations to identify a semantic interpretation that resolves the anaphora in the subsequently spoken response based on a context of the dialogue determined from the one or more of the spoken user responses spoken before the subsequently spoken response. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. One or more non-transitory computer-readable media comprising instructions that when executed by one or more computers cause the one or more computers to:
-
receive data generated based on spoken user responses to prompts associated with a dialogue, the data comprising a portion corresponding to one or more of the spoken user responses spoken before a subsequently spoken response of the spoken user responses; generate a list of natural language understanding (NLU)-ranked semantic interpretations for the subsequently spoken response; determine, based on the portion corresponding to the one or more of the spoken user responses spoken before the subsequently spoken response, a plurality of key-value pairs corresponding to different possible resolutions for an anaphora in the subsequently spoken response; and re-rank, based on the plurality of key-value pairs, the list of NLU-ranked semantic interpretations to identify a semantic interpretation that resolves the anaphora in the subsequently spoken response based on a context of the dialogue determined from the one or more of the spoken user responses spoken before the subsequently spoken response. - View Dependent Claims (14, 15, 16, 17)
-
Specification