Clustering user utterance intents with semantic parsing
First Claim
1. A system for training a spoken language understanding (SLU) classifier, comprising:
- one or more computing devices, said computing devices being in communication with each other via a computer network whenever there is a plurality of computing devices; and
a computer program having program modules executable by the one or more computing devices, the one or more computing devices being directed by the program modules of the computer program to,receive a corpus of user utterances,for each of the user utterances in the corpus,semantically parse the user utterance, andrepresent the result of said semantic parsing as a rooted semantic parse graph,combine the parse graphs representing all of the user utterances in the corpus into a single corpus graph that represents the semantic parses of the entire corpus and comprises a root node that is common to the parse graph representing each of the user utterances in the corpus,cluster the user utterances in the corpus into intent-wise homogeneous groups of user utterances, said clustering comprising finding subgraphs in the corpus graph that represent different groups of user utterances, each of said different groups having a similar user intent, each of the subgraphs being more specific than the root node alone and more general than the full semantic parses of the individual user utterances,use the intent-wise homogeneous groups of user utterances to train the SLU classifier, andoutput the trained SLU classifier.
1 Assignment
0 Petitions
Accused Products
Abstract
A system is provided that trains a spoken language understanding (SLU) classifier. A corpus of user utterances is received. For each of the user utterances in the corpus, the user utterance is semantically parsed, and the result of this semantic parsing is represented as a rooted semantic parse graph. The parse graphs representing all of the user utterances in the corpus are then combined into a single corpus graph that represents the semantic parses of the entire corpus. The user utterances in the corpus are then clustered into intent-wise homogeneous groups of user utterances, where this clustering includes finding subgraphs in the corpus graph that represent different groups of user utterances, and each of these different groups has a similar user intent. The intent-wise homogeneous groups of user utterances are then used to train the SLU classifier, and the trained SLU classifier is output.
25 Citations
20 Claims
-
1. A system for training a spoken language understanding (SLU) classifier, comprising:
-
one or more computing devices, said computing devices being in communication with each other via a computer network whenever there is a plurality of computing devices; and a computer program having program modules executable by the one or more computing devices, the one or more computing devices being directed by the program modules of the computer program to, receive a corpus of user utterances, for each of the user utterances in the corpus, semantically parse the user utterance, and represent the result of said semantic parsing as a rooted semantic parse graph, combine the parse graphs representing all of the user utterances in the corpus into a single corpus graph that represents the semantic parses of the entire corpus and comprises a root node that is common to the parse graph representing each of the user utterances in the corpus, cluster the user utterances in the corpus into intent-wise homogeneous groups of user utterances, said clustering comprising finding subgraphs in the corpus graph that represent different groups of user utterances, each of said different groups having a similar user intent, each of the subgraphs being more specific than the root node alone and more general than the full semantic parses of the individual user utterances, use the intent-wise homogeneous groups of user utterances to train the SLU classifier, and output the trained SLU classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An utterance intent determination system, comprising:
-
one or more computing devices, said computing devices being in communication with each other via a computer network whenever there is a plurality of computing devices; and a computer program having program modules executable by the one or more computing devices, the one or more computing devices being directed by the program modules of the computer program to, receive a corpus of user utterances, for each of the user utterances in the corpus, semantically parse the user utterance, and represent the result of said semantic parsing as a rooted semantic parse graph, combine the parse graphs representing all of the user utterances in the corpus into a single corpus graph that represents the semantic parses of the entire corpus and comprises a root node that is common to the parse graph representing each of the user utterances in the corpus, cluster the user utterances in the corpus into intent-wise homogeneous groups of user utterances, said clustering comprising finding subgraphs in the corpus graph that represent different groups of user utterances, each of said different groups having a similar user intent, each of the subgraphs being more specific than the root node alone and more general than the full semantic parses of the individual user utterances, use the intent-wise homogeneous groups of user utterances to train a spoken language understanding (SLU) classifier, receive a particular utterance input by a user, and use the trained SLU classifier to determine the intent of the user from said particular utterance. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer-implemented process for training a spoken language understanding (SLU) classifier, comprising the actions of:
-
using one or more computing devices to perform the following process actions, the computing devices being in communication with each other via a computer network whenever a plurality of computing devices is used; receiving a corpus of user utterances, for each of the user utterances in the corpus, semantically parsing the user utterance, and representing the result of said semantic parsing in a hierarchical structure, combining the hierarchical structures representing all of the user utterances in the corpus into a single hierarchical structure that represents the semantic parses of the entire corpus and comprises a root node that is common to the hierarchical structure representing each of the user utterances in the corpus, clustering the user utterances in the corpus into intent-wise homogeneous groups of user utterances, said clustering comprising finding substructures in the single hierarchical structure that represent different groups of user utterances, each of said different groups having a similar user intent, each of the substructures being more specific than the root node alone and more general than the full semantic parses of the individual user utterances, using the intent-wise homogeneous groups of user utterances to train the SLU classifier, and outputting the trained SLU classifier.
-
Specification