Link based clustering of hyperlinked documents
First Claim
Patent Images
1. An automated method, comprising:
- identifying a set of documents;
determining whether first and second documents in the set of documents are similar based on a number of documents that point to both the first and second documents and a number of back links associated with the first and second documents, where a result of the determination of whether the first and second documents are similar is inversely proportional to the number of back links associated with the first and second documents; and
forming a group from the first and second documents when the first and second documents are determined to be similar.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for grouping hyperlinked documents are provided. Links near or in the neighborhood of the hyperlinked documents are analyzed in order to group the hyperlinked documents by topic. For example, links that are search results can be grouped by identifying other hyperlinked documents that have multiple forward links to the search results. The search results can then be grouped according to the forward links of the other hyperlinked documents.
-
Citations
25 Claims
-
1. An automated method, comprising:
-
identifying a set of documents; determining whether first and second documents in the set of documents are similar based on a number of documents that point to both the first and second documents and a number of back links associated with the first and second documents, where a result of the determination of whether the first and second documents are similar is inversely proportional to the number of back links associated with the first and second documents; and forming a group from the first and second documents when the first and second documents are determined to be similar. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer system, comprising:
-
means for identifying a set of documents; means for determining whether first and second documents in the set of documents relate to a similar topic based on a number of documents that simultaneously point to both the first and second documents and a number of back links associated with the first and second documents, wherein a result of the determination of whether the first and second documents relate to a similar topic is inversely proportional to the number of back links associated with the first and second documents; and means for grouping the first and second documents when the first and second documents are determined to relate to a similar topic. - View Dependent Claims (15)
-
-
16. An automated method, comprising:
-
identifying a set of documents; identifying each of the documents in the set of documents as a separate group; determining whether to merge two of the groups based on a number of documents that point to documents in both of the two groups and a number of back links associated with the two groups, where a result of the determination of whether to merge the two groups is inversely proportional to the number of back links associated with the two groups; and merging the two groups when it is determined that the two groups should be merged.
-
-
17. An automated method, comprising:
-
identifying a set of documents; identifying each of the documents in the set of documents as a separate group; determining whether two of the groups relate to a similar topic based on a number of documents that point to documents in both of the two groups and a number of back links associated with the two groups, where a result of the determination of whether the two groups relate to a similar topic is inversely proportional to the number of back links associated with the two groups; and merging the two groups when it is determined that the two groups relate to a similar topic. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A system, comprising:
-
a memory to store instructions; and a processor to execute the instructions in the memory to; identify a set of documents, identify the documents in the set of documents as separate groups, determine whether two of the groups relate to a similar topic based on a number of documents that simultaneously point to documents in both of the two groups and a number of back links associated with the two groups, where a result of the determination of whether the two groups relate to a similar topic is inversely proportional to the number of back links associated with the two groups, and merge the two groups when it is determined that the two groups relate to a similar topic. - View Dependent Claims (23)
-
-
24. An automated method, comprising:
-
receiving a search query; identifying a set of documents based on the search query; forming groups from documents in the set of documents based on a function that receives as inputs a number of documents that point to pairs of the documents in the set of documents and a number of back links associated with the documents in the set of documents, where a result of the function is inversely proportional to the number of back links associated with the documents; generating a search result document in which at least one of the groups is visually separated from another one of the groups; and presenting the search result document. - View Dependent Claims (25)
-
Specification