Taxonomy-based object classification
First Claim
1. A computer-implemented method for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the object being external to the taxonomy, the method comprising:
- (A) identifying a plurality of search strings based on a first plurality of class keywords associated with the plurality of nodes, wherein identifying comprises, for at least one particular node N in the plurality of nodes;
(A)(1) traversing a first branch in the taxonomy linking a root node of the taxonomy to node N; and
(A)(2) for each node M in the first branch, selecting a class keyword associated with node M and adding the selected class keyword to a first one of the plurality of search strings associated with node N, whereby the first one of the plurality of search strings contains a second plurality of class keywords;
(B) using a search engine to perform a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings;
(C) identifying one of the plurality of nodes based on the plurality of search scores; and
(D) classifying the object in a class associated with the identified node.
3 Assignments
0 Petitions
Accused Products
Abstract
Objects, such as documents, are classified according to a taxonomy. The taxonomy includes nodes, corresponding to object classes, arranged in a hierarchy. Class keywords are associated with the nodes. Search strings are formed for the classes by traversing the taxonomic branches and concatenating the keywords associated with the classes. For each object to be classified, a search engine is used to perform searches on the object using the search strings. The searches produce search scores for each search string. Each object is classified by identifying the class(es) corresponding to the highest search score(s) for the object, and classifying the object into the identified class(es).
47 Citations
21 Claims
-
1. A computer-implemented method for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the object being external to the taxonomy, the method comprising:
-
(A) identifying a plurality of search strings based on a first plurality of class keywords associated with the plurality of nodes, wherein identifying comprises, for at least one particular node N in the plurality of nodes; (A)(1) traversing a first branch in the taxonomy linking a root node of the taxonomy to node N; and (A)(2) for each node M in the first branch, selecting a class keyword associated with node M and adding the selected class keyword to a first one of the plurality of search strings associated with node N, whereby the first one of the plurality of search strings contains a second plurality of class keywords; (B) using a search engine to perform a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; (C) identifying one of the plurality of nodes based on the plurality of search scores; and (D) classifying the object in a class associated with the identified node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product comprising computer-executable instructions, tangibly stored on a computer readable medium, for execution by a processor to perform a method for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the object being external to the taxonomy, the method comprising:
-
(A) identifying a first plurality of search strings based on a plurality of class keywords associated with the plurality of nodes, wherein identifying comprises, for at least one particular node N in the plurality of nodes; (A)(1) traversing a first branch in the taxonomy linking a root node of the taxonomy to node N; and (A)(2) for each node M in the first branch, selecting a class keyword associated with node M and adding the selected class keyword to a first one of the plurality of search strings associated with node N, whereby the first one of the plurality of search strings contains a second plurality of class keywords; (B) performing a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; (C) identifying one of the plurality of nodes based on the plurality of search scores; and (D) classifying the object in a class associated with the identified node. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
Specification