Evaluation of nodes
First Claim
Patent Images
1. A non-transitory computer-readable medium embodying a program that, when executed by at least one computing device, causes the at least one computing device to at least:
- identify a plurality of items in a node;
identify a plurality of descriptive terms associated with individual ones of the plurality of items in the node;
identify a subset of the plurality of descriptive terms;
calculate a quality score for the node reflecting a homogeneity of the node, the quality score being based at least in part upon a number of the plurality of items in the node having a description including at least one descriptive term among the subset of the plurality of descriptive terms;
determine whether the quality score for the node meets a specified threshold of homogeneity for the node;
determine an item score for at least one of the plurality of items among the node; and
determine whether the item score meets an item score threshold, the item score threshold reflecting a desired level of homogeneity among the plurality of items in the node.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
56 Citations
20 Claims
-
1. A non-transitory computer-readable medium embodying a program that, when executed by at least one computing device, causes the at least one computing device to at least:
-
identify a plurality of items in a node; identify a plurality of descriptive terms associated with individual ones of the plurality of items in the node; identify a subset of the plurality of descriptive terms; calculate a quality score for the node reflecting a homogeneity of the node, the quality score being based at least in part upon a number of the plurality of items in the node having a description including at least one descriptive term among the subset of the plurality of descriptive terms; determine whether the quality score for the node meets a specified threshold of homogeneity for the node; determine an item score for at least one of the plurality of items among the node; and determine whether the item score meets an item score threshold, the item score threshold reflecting a desired level of homogeneity among the plurality of items in the node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a data store; and at least one computing device in communication with the data store, the at least one computing device being configured to at least; determine a plurality of descriptive terms associated with a plurality of items in a node; identify a number of highest ranked descriptive terms among the plurality of descriptive terms; determine a belongingness score for individual ones of the plurality of items based at least in part upon a number of the plurality of items in the node having a description including at least one of the highest ranked descriptive terms; calculate a quality score for the node based at least in part upon the belongingness score of the individual ones of the plurality of items in the node; determine whether the quality score for the node meets a specified threshold of homogeneity for the node; determine whether the belongingness score for an individual item among the plurality of items for the node meets an item score threshold reflecting a desired level of homogeneity among the plurality of items in the node. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method comprising:
-
identifying, via at least one computing device, a plurality of items in a node; identifying, via the at least one computing device, a plurality of descriptive terms associated with individual ones of the plurality of items in the node; identifying, via the at least one computing device, a subset of the plurality of descriptive terms; determining, via the at least one computing device, which descriptive terms among the subset of the plurality of descriptive terms are associated with the plurality of items in the node; calculating, via the at least one computing device, a quality score for the node reflecting a homogeneity of the node, the quality score being based at least in part upon a number of the plurality of items in the node having a description including at least one descriptive term among the subset of the plurality of descriptive terms determining, via the at least one computing device, whether the quality score for the node meets a specified threshold of homogeneity for the node; and determine whether an item score for at least one of the plurality of items among the node meets an item score threshold, the item score threshold reflecting a desired level of homogeneity among the plurality of items in the node. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification