Automatic index term augmentation in document retrieval
First Claim
Patent Images
1. A method for automatically choosing index terms to be associated with a document D, for purposes of facilitating document retrieval processes, comprising:
- creating a search query Q comprised of terms in document D;
applying the search query Q to a collection of documents C0;
selecting the N0 documents from the collection of documents C0 which achieve the highest scores upon application of the search query Q; and
selecting IT terms for use as index terms for document D from among terms in the N0 documents based upon the co-occurrence of terms in the N0 documents with terms in the document D.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are methods and systems for automatically assigning index terms to electronic documents such as Web pages or sites in a manner which may be used to facilitate the retrieval of electronic documents of interest. The method involves determining co-occurrences of terms in other documents with the electronic document, and selecting terms as index terms based upon those scores. The method permits the efficient retrieval of electronic documents.
61 Citations
46 Claims
-
1. A method for automatically choosing index terms to be associated with a document D, for purposes of facilitating document retrieval processes, comprising:
-
creating a search query Q comprised of terms in document D;
applying the search query Q to a collection of documents C0;
selecting the N0 documents from the collection of documents C0 which achieve the highest scores upon application of the search query Q; and
selecting IT terms for use as index terms for document D from among terms in the N0 documents based upon the co-occurrence of terms in the N0 documents with terms in the document D. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A device for automatically choosing index terms to be associated with a document D, for purposes of facilitating document retrieval processes, comprising:
-
(a) means for creating a search query Q comprised of terms in document D;
(b) means for applying the search query Q to a collection of documents C0;
(c) means for selecting the N0 documents from the collection of documents C0 which achieve the highest scores upon application of the search query Q; and
(d) means for selecting IT terms for use as index terms for document D from among terms in the N0 documents based upon the co-occurrence of terms in the N0 documents with terms in the document D. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method for automatically assigning-an-index term to a document D, the method comprising:
-
selecting one or more index terms from a plurality of index terms;
identifying one or more documents of a plurality of documents to which each of the one or more index terms has been assigned;
comparing, for each of the one or more index terms, each of the identified documents to the document D;
determining a score for each of the one or more index terms based on the comparing; and
assigning the index term associated with the highest score to the document D. - View Dependent Claims (40, 41, 42)
-
-
43. A device for automatically assigning an index term to a document D, the device comprising:
-
means for selecting one or more index terms from a plurality of index terms;
means for identifying one or more documents of a plurality of documents to which each of the one or more index terms has been assigned;
means for comparing, for each of the one or more index terms, each of the identified documents to the document D;
means for determining a score for each of the one or more index terms based on the comparing; and
means for assigning the index term associated with the highest score to the document D. - View Dependent Claims (44, 45, 46)
-
Specification