Techniques for finding related hyperlinked documents using link-based analysis
First Claim
Patent Images
1. A method of generating lists of related documents performed by a computer system, the method comprising:
- providing, by a processor of the computer system, a first set of documents that link to a particular document and a second set of documents that are pointed to by forward links in the documents of the first set;
assigning, by the processor, values to the forward links in the documents in the first set;
generating, by the processor, scores for the documents in the second set based on the assigned values of the forward links;
generating, by the processor, a first list of documents that are related to the particular document by selecting a subset of documents from the second set based on the generated scores;
generating, by the processor, a second list of related documents for at least one of the documents in the first list; and
removing, by the processor, the at least one of the documents from the first list if the particular document is not in the second list.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for finding related hyperlinked documents using link-based analysis are provided. Backlink and forwardlink sets can be utilized to find web pages that are related to a selected web page. The scores for links from web pages that are from the same host and links from web pages with numerous links can be reduced to achieve a better list of related web pages. The list of related web pages can be utilized as a feature to a word-based search engine or an addition to a web browser.
-
Citations
31 Claims
-
1. A method of generating lists of related documents performed by a computer system, the method comprising:
-
providing, by a processor of the computer system, a first set of documents that link to a particular document and a second set of documents that are pointed to by forward links in the documents of the first set; assigning, by the processor, values to the forward links in the documents in the first set; generating, by the processor, scores for the documents in the second set based on the assigned values of the forward links; generating, by the processor, a first list of documents that are related to the particular document by selecting a subset of documents from the second set based on the generated scores; generating, by the processor, a second list of related documents for at least one of the documents in the first list; and removing, by the processor, the at least one of the documents from the first list if the particular document is not in the second list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product, embodied within a computer readable memory device, for generating lists of related documents, comprising:
-
computer code that receives a first set of documents that link to a particular document and a second set of documents that are pointed to by forward links in the documents of the first set; computer code that assigns values to the forward links in the documents in the first set; computer code that generates scores for the documents in the second set based on the assigned values of the forward links; computer code that generates a first list of documents that are related to the particular document by selecting a subset of documents from the second set based on the generated scores; computer code that generates a second list of related documents for at least one of the documents in the first list; and computer code that removes the at least one of the documents from the first list if the particular document is not in the second list. - View Dependent Claims (17, 18)
-
-
19. A method of generating lists of related documents performed by a computer system, the method comprising:
-
providing, by a processor of the computer system, a first set of documents that have forward links to a particular document and a second set of documents that are pointed to by the forward links in the documents of the first set; assigning, by the processor, values to forward links in the documents in the first set; generating, by the processor, scores for the documents in the second set based on the assigned values of the forward links that point to the documents in the second set; generating, by the processor, a first list of documents that are related to the particular document by selecting documents from the second set based on the generated scores; generating, by the processor, second lists of related documents for the documents in the first list; categorizing, by the processor, a document from the first list as being symmetrically related to the particular document when the particular document is in the second list corresponding to the document; categorizing, by the processor, a document from the first list as being unsymmetrically related to the particular document when the particular document is not in the second list corresponding to the document; presenting, by a network device or by a display device of the computer system, the first list of documents in a presentation order based on the categorization. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of generating a list of documents related to a particular document performed by a computer system, the method comprising:
-
receiving, by a processor of the computer system, a first set of documents that have forward links to the particular document; receiving, by the processor, a second set of documents that are pointed to by forward links in the documents of the first set; assigning, by the processor, values to the forward links of the documents in the first set; generating, by the processor, a score for each document in the second set according to the values of the forward links pointing to each document in the second set; selecting, by the processor, documents from the second set based on the scores of the documents in the second set to generate the list of documents related to the particular document; generating, by the processor, second lists of related documents for the documents in the list of documents related to the particular document; and removing, by the processor, a particular document from the list of documents related to the particular document if the particular document is not in the second list corresponding to the particular document. - View Dependent Claims (29, 30, 31)
-
Specification