Taxonomy-Based Object Classification
First Claim
1. A computer-implemented method for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the method comprising:
- (A) identifying a plurality of search strings based on a plurality of class keywords associated with the plurality of nodes;
(B) using a search engine to perform a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings;
(C) identifying one of the plurality of nodes based on the plurality of search scores; and
(D) classifying the object in a class associated with the identified node.
3 Assignments
0 Petitions
Accused Products
Abstract
Objects, such as documents, are classified according to a taxonomy. The taxonomy includes nodes, corresponding to object classes, arranged in a hierarchy. Class keywords are associated with the nodes. Search strings are formed for the classes by traversing the taxonomic branches and concatenating the keywords associated with the classes. For each object to be classified, a search engine is used to perform searches on the object using the search strings. The searches produce search scores for each search string. Each object is classified by identifying the class(es) corresponding to the highest search score(s) for the object, and classifying the object into the identified class(es).
38 Citations
69 Claims
-
1. A computer-implemented method for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the method comprising:
-
(A) identifying a plurality of search strings based on a plurality of class keywords associated with the plurality of nodes; (B) using a search engine to perform a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; (C) identifying one of the plurality of nodes based on the plurality of search scores; and (D) classifying the object in a class associated with the identified node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented apparatus for classifying an object in a taxonomy, the taxonomy including a plurality of nodes associated with a plurality of classes, the method comprising:
-
search string identification means for identifying a plurality of search strings based on a plurality of class keywords associated with the plurality of nodes; search means for performing a plurality of searches on the object using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; node identification means for identifying one of the plurality of nodes based on the plurality of search scores; and classification means for classifying the object in a class associated with the identified node. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 48)
-
-
24. A computer-implemented method comprising:
-
(A) identifying a class in a taxonomy; (B) identifying a plurality of search strings based on a plurality of class keywords associated with the class; (C) using a search engine to perform a plurality of searches on a plurality of objects using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; (D) identifying a plurality of aggregate search scores corresponding to the plurality of search strings; (E) identifying one of the plurality of search strings having a maximum one of the plurality of aggregate search scores; (F) identifying one of the plurality of class keywords corresponding to the identified one of the plurality of search strings; and (G) selecting the identified one of the plurality of class keywords as a label for the class. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
-
31. A computer-implemented apparatus comprising:
-
class identification means for identifying a class in a taxonomy; search string identification means for identifying a plurality of search strings based on a plurality of class keywords associated with the class; means for performing a plurality of searches on a plurality of objects using the plurality of search strings to produce a plurality of search scores corresponding to the plurality of search strings; means for identifying a plurality of aggregate search scores corresponding to the plurality of search strings; means for identifying one of the plurality of search strings having a maximum one of the plurality of aggregate search scores; means for identifying one of the plurality of class keywords corresponding to the identified one of the plurality of search strings; and means for selecting the identified one of the plurality of class keywords as a label for the class. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38)
-
-
39. A computer-implemented method comprising:
-
(A) identifying a first plurality of documents associated with a user; (B) identifying a plurality of search strings based on a plurality of class keywords associated with a plurality of nodes in a taxonomy, the plurality of nodes being associated with a first plurality of classes; (C) identifying a second plurality of classes, in the taxonomy, associated with the first plurality of documents by performing a plurality of searches on the first plurality of documents using the identified plurality of search strings; and (D) associating the second plurality of classes with the user. - View Dependent Claims (40, 41, 42, 43, 44, 45)
-
-
46. A computer-implemented apparatus comprising:
-
means for identifying a first plurality of documents associated with a user; search string identification means for identifying a plurality of search strings based on a plurality of class keywords associated with a plurality of nodes in a taxonomy, the plurality of nodes being associated with a first plurality of classes; means for identifying a second plurality of classes, in the taxonomy, associated with the first plurality of documents by performing a plurality of searches on the first plurality of documents using the identified plurality of search strings; and means for associating the second plurality of classes with the user. - View Dependent Claims (47, 49, 50, 51, 52)
-
-
53. A computer-implemented method comprising:
-
(A) identifying a first plurality of classes, in a taxonomy, associated with, a first plurality of documents; (B) identifying a second plurality of classes, in the taxonomy, associated with a user; (C) identifying a subset of the first plurality of documents based on a comparison between the first plurality of classes and the second plurality of classes; and (D) associating the subset of the first plurality of documents with the user.
-
-
54. A computer-implemented apparatus comprising:
-
means for identifying a first plurality of classes, in a taxonomy, associated with a first plurality of documents; means for identifying a second plurality of classes, in the taxonomy, associated with a user; means for identifying a subset of the first plurality of documents based on a comparison between the first plurality of classes and the second plurality of classes; and means for associating the subset of the first plurality of documents with the user.
-
-
55. A computer-implemented method comprising:
-
(A) identifying a first plurality of classes, in a taxonomy, associated with a plurality of documents; (B) modifying a plurality of class keywords associated with a second plurality of classes in the taxonomy; (C) using a search engine to perform a plurality of searches on the plurality of documents using a plurality of search strings based on the modified plurality of class keywords to produce a plurality of search scores corresponding to the plurality of search strings; (D) identifying a subset of the modified plurality of class keywords based on the plurality of search scores; and (E) modifying the taxonomy in accordance with the identified subset of the modified plurality of class keywords. - View Dependent Claims (56, 57, 58, 59)
-
-
60. A computer-implemented apparatus comprising:
-
means for identifying a first plurality of classes, in a taxonomy, associated with a plurality of documents; means for modifying a plurality of class keywords associated with a second plurality of classes in the taxonomy; means for performing a plurality of searches on the plurality of documents using a plurality of search strings based on the modified plurality of class keywords to produce a plurality of search scores corresponding to the plurality of search strings; means for identifying a subset of the modified plurality of class keywords based on the plurality of search scores; and means for modifying the taxonomy in accordance with the identified subset of the modified plurality of class keywords. - View Dependent Claims (61, 62, 63, 64, 65)
-
-
66. A computer-implemented method comprising:
-
(A) identifying a plurality of search strings based on a plurality of class keywords associated with a plurality of nodes in a taxonomy, the plurality of nodes corresponding to a plurality of classes, each of the plurality of search strings corresponding to a distinct one of the plurality of classes; (B) using a search engine to perform a plurality of searches on a plurality of documents using the plurality of search strings to identify a plurality of subsets of the plurality of documents, the plurality of subsets corresponding to the plurality of search strings; and (C) associating each of the plurality of subsets of the plurality of documents with the class corresponding to the search string which resulted in identifying the subset. - View Dependent Claims (67)
-
-
68. A computer-implemented apparatus comprising:
-
search string identification means for identifying a plurality of search strings based on a plurality of class keywords associated with a plurality of nodes in a taxonomy, the plurality of nodes corresponding to a plurality of classes, each of the plurality of search strings corresponding to a distinct one of the plurality of classes; search means for using a search engine to perform a plurality of searches on a plurality of documents using the plurality of search strings to identify a plurality of subsets of the plurality of documents, the plurality of subsets corresponding to the plurality of search strings; and means for associating each of the plurality of subsets of the plurality of documents with the class corresponding to the search string which resulted in identifying the subset. - View Dependent Claims (69)
-
Specification