Method for identifying related pages in a hyperlinked database
First Claim
1. A method for identifying related pages from a plurality of pages in a linked database, comprising the steps of:
- selecting an initial page from the plurality of pages;
representing the initial page and pages linked to the initial page as a graph of nodes and edges in a memory;
repeatedly scoring the initial page and the pages linked to the initial page on connectivity of the pages; and
selecting a subset of the pages scored on connectivity that have scores greater than a first predetermined threshold as the related pages of the linked database.
9 Assignments
0 Petitions
Accused Products
Abstract
A method is described for identifying related pages among a plurality of pages in a linked database such as the World Wide Web. An initial page is selected from the plurality of pages. Pages linked to the initial page are represented as a graph in a memory. The pages represented in the graph are scored on content, and a set of pages is selected, the selected set of pages having scores greater than a first predetermined threshold. The selected set of pages is scored on connectivity, and a subset of the set of pages that have scores greater than a second predetermined threshold are selected as related pages.
42 Citations
16 Claims
-
1. A method for identifying related pages from a plurality of pages in a linked database, comprising the steps of:
-
selecting an initial page from the plurality of pages;
representing the initial page and pages linked to the initial page as a graph of nodes and edges in a memory;
repeatedly scoring the initial page and the pages linked to the initial page on connectivity of the pages; and
selecting a subset of the pages scored on connectivity that have scores greater than a first predetermined threshold as the related pages of the linked database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification