Semi-automatic index term augmentation in document retrieval
First Claim
Patent Images
1. A processor-implemented method for assigning categories of items to supercategories, comprising:
- (a) assigning a subset of the categories in the collection to supercategories manually,(b) selecting a category Ci from among the categories in the collection not yet assigned to supercategories which has not yet been processed,(c) calculating a likelihood function Lik for the category Ci and a category Ck in the collection which has previously been assigned to a supercategory Sj manually, which likelihood function is based upon the likelihood that a term occurring in the category Ci also occurs in the category Ck,(d) repeating step (c) for a plurality of other categories Ck in the collection which have previously been assigned to a supercategory Sj manually,(e) assigning the category Ci to a supercategory Sj based on the likelihood functions Lik that a term occurring in the category Ci also occurs in the category Ck which is assigned to supercategory Sj, and(f) repeating steps (b)–
(e) for a plurality of other categories in the collection which have not yet been assigned to supercategories and which have not yet been processed.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are methods and systems for indexing or retrieving materials accessible through computer networks.
47 Citations
26 Claims
-
1. A processor-implemented method for assigning categories of items to supercategories, comprising:
-
(a) assigning a subset of the categories in the collection to supercategories manually, (b) selecting a category Ci from among the categories in the collection not yet assigned to supercategories which has not yet been processed, (c) calculating a likelihood function Lik for the category Ci and a category Ck in the collection which has previously been assigned to a supercategory Sj manually, which likelihood function is based upon the likelihood that a term occurring in the category Ci also occurs in the category Ck, (d) repeating step (c) for a plurality of other categories Ck in the collection which have previously been assigned to a supercategory Sj manually, (e) assigning the category Ci to a supercategory Sj based on the likelihood functions Lik that a term occurring in the category Ci also occurs in the category Ck which is assigned to supercategory Sj, and (f) repeating steps (b)–
(e) for a plurality of other categories in the collection which have not yet been assigned to supercategories and which have not yet been processed. - View Dependent Claims (2, 3, 4)
-
-
5. A processor-implemented method for assigning categories of items to supercategories, comprising:
-
(a) assigning a subset of the categories in the collection to supercategories manually, (b) selecting a category Ci from among the categories in the collection not yet assigned to supercategories which has not yet been processed, (c) selecting a supercategory Sj from among the set of supercategories, (d) calculating a likelihood function for the category Ci and a category Ck in the collection which has previously been assigned to the supercategory Sj manually, which likelihood function is based upon the likelihood that a term occurring in the category Ci also occurs in the category Ck, (e) repeating step (d) for a plurality of other categories Ck in the collection which have previously been assigned to the supercategory SJ manually, (f) calculating a total score for the category Ci for the supercategory Sj which total score is based upon the likelihood functions for the category Ci and the categories Ck in the collection which have previously been assigned to the supercategory Sj manually, (g) repeating steps (c)–
(f) for a plurality of other supercategories Sj,(h) assigning category Ci to the supercategory for which the total score calculated for the category Ci is the highest, and (i) repeating steps (b)–
(h) for a plurality of other categories in the collection which have not yet been assigned to supercategories and which have not yet been processed. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A processor-implemented device for assigning categories of items to supercategories, comprising:
-
(a) means for assigning a subset of the categories in the collection to supercategories manually, (b) means for selecting a category Ci from among the categories in the collection not yet assigned to supercategories which has not yet been processed, (c) means for calculating a likelihood function Lik for the category Ci and a category Ck in the collection which has previously been assigned to a supercategory Sj manually, which likelihood function is based upon the likelihood that a term occurring in the category Ci also occurs in the category Ck, (d) means for repeating step (c) for a plurality of other categories Ck in the collection which have previously been assigned to a supercategory Sj manually, (e) means for assigning the category Ci to a supercategory Sj based on the likelihood functions Lik that a term occurring in the category Ci also occurs in the category Ck which is assigned to supercategory Sj, and (f) means for repeating steps (b)–
(e) for a plurality of other categories in the collection which have not yet been assigned to supercategories and which have not yet been processed. - View Dependent Claims (15, 16, 17)
-
-
18. A processor-implemented device for assigning categories of items to supercategories, comprising:
-
(a) means for assigning a subset of the categories in the collection to supercategories manually, (b) means for selecting a category Ci from among the categories in the collection not yet assigned to supercategories which has not yet been processed, (c) means for selecting a supercategory Sj from among the set of supercategories, (d) means for calculating a likelihood function for the category Ci and a category Ck in the collection which has previously been assigned to the supercategory Sj manually, which likelihood function is based upon the likelihood that a term occurring in the category Ci also occurs in the category Ck, (e) means for repeating step (d) for a plurality of other categories Ck in the collection which have previously been assigned to the supercategory Sj manually, (f) means for calculating a total score for the category Ci for the supercategory Sj, which total score is based upon the likelihood functions for the category Ci and the categories Ck in the collection which have previously been assigned to the supercategory Sj manually, (g) means for repeating steps (c)–
(f) for a plurality of other supercategories Sj,(h) means for assigning category Ci to the supercategory for which the total score calculated for the category Ci is the highest, and (i) means for repeating steps (b)–
(h) for a plurality of other categories in the collection which have not yet been assigned to supercategories and which have not yet been processed. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
-
Specification