Method and apparatus for finding related documents in a collection of linked documents using a bibliographic coupling link analysis
First Claim
1. A method for identifying related documents in a set of linked documents, said method comprising the steps of:
- a) generating a link index for each document in said set of linked documents;
b) identifying document pairs of said set of linked documents;
c) generating bibliographic coupling strength information for each of said identified document pairs; and
d) performing a suitable analysis operation using said bibliographic coupling strength information to identify related information.
7 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for identifying related documents in a collection of linked documents. In the method the link structure of documents to other documents are analyzed. By analyzing only the link structure, a process intensive content analysis of the documents is avoided. A citation analysis technique, such as bibliographic coupling analysis, is performed on the set of documents to extract link information. For bibliographic coupling analysis that information would include the number of other documents that a given pair of documents link to. By using the link information, related documents are identified using a suitable analysis technique, such as clustering or spreading activation.
-
Citations
11 Claims
-
1. A method for identifying related documents in a set of linked documents, said method comprising the steps of:
-
a) generating a link index for each document in said set of linked documents;
b) identifying document pairs of said set of linked documents;
c) generating bibliographic coupling strength information for each of said identified document pairs; and
d) performing a suitable analysis operation using said bibliographic coupling strength information to identify related information. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer based system for identifying related documents in a collection of linked documents, said system comprising:
-
means for accessing said documents in said collection of linked documents;
means for generating a citation index for a document;
means for generating document pairs from said documents in said collection of linked documents;
means for generating bibliographic coupling strength information for each of said identified document pairs; and
at least one means for performing a suitable analysis operation using said bibliographic coupling strength information to identify related information. - View Dependent Claims (7, 8, 9)
-
-
10. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for the identifying related documents in a collection of linked documents, said method steps comprising:
-
a) generating a link index for each document in said set of linked documents;
b) identifying document pairs of said set of linked documents;
c) generating bibliographic coupling strength information for each of said identified document pairs; and
d) performing a suitable analysis operation using said bibliographic coupling strength information to identify related information. - View Dependent Claims (11)
-
Specification