Web page ranking with hierarchical considerations
First Claim
1. A method of evaluating content on the Web comprising:
- identifying a plurality of pages on the Web;
identifying a plurality of nodes, each node associated with a hierarchical structure to which at least one of the pages corresponds;
grouping the plurality of pages to the corresponding nodes;
for each node, determining a first value based, at least in part, on the linking relationships between that node and the other nodes;
for each page, determining a second value based, at least in part, on characteristics of that page; and
determining an importance value for each page based, at least in part, on the second value associated with that page and the first value associated with the node to which the page corresponds.
2 Assignments
0 Petitions
Accused Products
Abstract
The described systems, methods and data structures are directed to ranking Web pages with hierarchical considerations. The hierarchical structures and the linking relationships of the World Wide Web are used to provide a page importance ranking for Web searches. The linking relationships are aggregated to a high level node at each of the hierarchical structures. A link graph analysis is performed on the aggregated linking relationships to determine the importance of each node. The importance of each node may be propagated to pages associated with that node. For each page, the importance of that page and the importance of the node associated with the page are used to calculate the page importance ranking.
76 Citations
36 Claims
-
1. A method of evaluating content on the Web comprising:
-
identifying a plurality of pages on the Web;
identifying a plurality of nodes, each node associated with a hierarchical structure to which at least one of the pages corresponds;
grouping the plurality of pages to the corresponding nodes;
for each node, determining a first value based, at least in part, on the linking relationships between that node and the other nodes;
for each page, determining a second value based, at least in part, on characteristics of that page; and
determining an importance value for each page based, at least in part, on the second value associated with that page and the first value associated with the node to which the page corresponds. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
a data store containing data about pages on the Web, the data for each page indicating characteristics of the page and a host to which the page corresponds; and
a ranking module configured to determine links pointing to each of the pages from the data in the data store, the ranking module also configured to aggregate the links associated with each host and to calculate a weight value for the host based on the aggregated links, the ranking module further configured to compute an importance value for each page based, at least in part, on the weight value of the host corresponding to the page and the characteristics of the page. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. One or more computer-readable media encoded with a data structure, comprising:
-
a first data field including identifiers, each identifier identifying a page on the Web;
a second data field including a first set of values, each of the values in the first set representing the importance of a host, each of the host associated with a hierarchical structure to which at least one of the pages in the first data field corresponds;
a third data field including a second set of values, each of the values in the second set representing the importance of a page identified in the first data field; and
a fourth data field including a third set of values, each of the values in the third set representing a ranking for a page identified in the first data field, the ranking being determined based, at least in part, on a value from the second set associated with the page and a value from the first set associated with a host to which the page corresponds. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. An apparatus comprising:
-
means for gathering data about pages on the Web;
means for determining hosts to which the pages correspond;
means for determining an importance of each host;
means for determining an importance of each page; and
means for ranking each page based, at least in part, on the importance of the page and the importance of the corresponding host. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification