Artificial intelligence system and method for auto-naming customer tree nodes in a data structure
First Claim
1. A computer-implemented method for auto-naming customer behavior tree (CBT) nodes, comprising:
- providing, by a computing device, a hierarchy of nodes at a plurality of levels of the CBT;
generating, by a processor, a first corpus comprising product description of all items in a category and product attributes for each node of a final level of the CBT;
creating, based on the first corpus, a first term-document matrix associated with each word in the first corpus and a frequency that the word appears in the first corpus;
identifying a first group of high-frequency words in the first term-document matrix;
removing the first group of the high-frequency words from the first corpus to obtain a second corpus;
creating a second term-document matrix associated with the second corpus based on each of a set of predefined rules, a value of the second term-document matrix being defined as a data set to represent a number of times each word appears in the second corpus, the set of the predefined rules comprising at least one of an n-gram frequency model, a common themes topic model, an overlapping topic model, a word vector representation model, and a full text approach model;
identifying, based on a data set of the second term-document matrix, a second group of high-frequency words to represent node names such that the second group of the high-frequency words satisfy a predefined frequency cut-off threshold;
selecting, by the processor, a best set of the predefined rules based on an automatic evaluation model;
generating a node name associated with the second group of the high-frequency words by removing a duplicate word in each node, using the best set of the predefined rules and based on a frequency ratio of each word in each node to all the nodes;
incorporating feedback associated with other nodes in the category to generate a predicted name for each node; and
selecting a final name for each node from the predicted name and the generated node name associated with the second group of the high-frequency words.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for auto-naming nodes in a behavior tree are provided. An example method can include: providing a hierarchy of tree nodes by a computing device; generating a first corpus for each node at a final level; creating a first term-document matrix associated with the first corpus; identifying a first group of high-frequency words in the first term-document matrix; removing the first group of the high-frequency words obtain a second corpus; creating a second term-document matrix based on each of a set of predefined rules; identifying a second group of high-frequency words to represent node names; selecting a best set of the predefined rules based on an automatic evaluation model; generating a node name by removing a duplicate word in each node; incorporating feedback to generate a predicted name for each node; and selecting a final name for each node from the predicted name and the generated node name.
15 Citations
20 Claims
-
1. A computer-implemented method for auto-naming customer behavior tree (CBT) nodes, comprising:
-
providing, by a computing device, a hierarchy of nodes at a plurality of levels of the CBT; generating, by a processor, a first corpus comprising product description of all items in a category and product attributes for each node of a final level of the CBT; creating, based on the first corpus, a first term-document matrix associated with each word in the first corpus and a frequency that the word appears in the first corpus; identifying a first group of high-frequency words in the first term-document matrix; removing the first group of the high-frequency words from the first corpus to obtain a second corpus; creating a second term-document matrix associated with the second corpus based on each of a set of predefined rules, a value of the second term-document matrix being defined as a data set to represent a number of times each word appears in the second corpus, the set of the predefined rules comprising at least one of an n-gram frequency model, a common themes topic model, an overlapping topic model, a word vector representation model, and a full text approach model; identifying, based on a data set of the second term-document matrix, a second group of high-frequency words to represent node names such that the second group of the high-frequency words satisfy a predefined frequency cut-off threshold; selecting, by the processor, a best set of the predefined rules based on an automatic evaluation model; generating a node name associated with the second group of the high-frequency words by removing a duplicate word in each node, using the best set of the predefined rules and based on a frequency ratio of each word in each node to all the nodes; incorporating feedback associated with other nodes in the category to generate a predicted name for each node; and selecting a final name for each node from the predicted name and the generated node name associated with the second group of the high-frequency words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for auto-naming customer behavior tree (CBT) nodes, comprising:
-
a processor of a computing device; and a computer program product comprising a non-transitory computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; providing, by a computing device, a hierarchy of nodes at a plurality of levels of the CBT; generating a first corpus comprising product descriptions of all items in a category and product attributes for each node of a final level of the CBT; creating, based on the first corpus, a first term-document matrix associated with each word in the first corpus and a frequency that the word appears in the first corpus; identifying a first group of high-frequency words in the first term-document matrix; removing the first group of the high-frequency words, the first corpus to obtain a second corpus; creating a second term-document matrix associated with the second corpus based on each of a set of predefined rules, a value of the second term-document matrix being defined as a data set to represent a number of times each word appears in the second corpus, the set of predefined rules comprising at least one of an n-gram frequency model, a common themes topic model, an overlapping topic model, a word vector representation model, and a full text approach model; identifying, based on a data set of the second term-document matrix, a second group of high-frequency words to represent node names such that the second group of the high-frequency words satisfy a predefined frequency cut-off threshold; choosing a best set of the predefined rules based on an automatic evaluation model; generating a node name associated with the second group of the high-frequency words by removing a duplicate word in each node, based on a frequency ratio of each word in each node to all the nodes; incorporating feedback associated with the nodes in the category to generate a predicted name for each node; and selecting a final name for each node from the predicted name and the generated node name associated with the second group of the high-frequency words. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification