Evaluation of web pages
First Claim
Patent Images
1. A system comprising:
- a crawler to obtain a plurality of web pages with the same or approximately the same content, wherein each two web pages of the plurality of web pages includes the same or approximately the same content; and
an indexer coupled to the crawler, and comprising one or more processors to;
determine a plurality of first evaluation values corresponding to respective ones of the plurality of web pages, wherein the plurality of first evaluation values comprises a plurality of first ranking parameter values;
identify a web page of the plurality of web pages as an original web page;
identify web pages other than the original web page of the plurality of web pages as reproduced web pages;
determine a second evaluation value associated with the original web page based at least in part on a combination comprising;
(1) a first evaluation value corresponding to the original web page from the plurality of first evaluation values; and
(2) an aggregation of first evaluation values corresponding to the reproduced web pages from the plurality of first evaluation values; and
rank the original web page among the plurality of web pages based at least in part on the second evaluation value associated with the original web page.
0 Assignments
0 Petitions
Accused Products
Abstract
A web page evaluation technique includes obtaining a plurality of web pages with the same or approximately the same content. Further, a plurality of generation times and a plurality of first evaluation values that correspond to respective ones of the plurality of web pages are determined. A web page among the plurality of web pages that has the earliest generation time is identified. A second evaluation value of the identified web page is determined according to the plurality of first evaluation values. The second evaluation value can be used to indicate a ranking of the identified web page.
12 Citations
18 Claims
-
1. A system comprising:
-
a crawler to obtain a plurality of web pages with the same or approximately the same content, wherein each two web pages of the plurality of web pages includes the same or approximately the same content; and an indexer coupled to the crawler, and comprising one or more processors to; determine a plurality of first evaluation values corresponding to respective ones of the plurality of web pages, wherein the plurality of first evaluation values comprises a plurality of first ranking parameter values; identify a web page of the plurality of web pages as an original web page; identify web pages other than the original web page of the plurality of web pages as reproduced web pages; determine a second evaluation value associated with the original web page based at least in part on a combination comprising; (1) a first evaluation value corresponding to the original web page from the plurality of first evaluation values; and (2) an aggregation of first evaluation values corresponding to the reproduced web pages from the plurality of first evaluation values; and rank the original web page among the plurality of web pages based at least in part on the second evaluation value associated with the original web page. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method, comprising:
-
obtaining a plurality of web pages with the same or approximately the same content, wherein each two web pages of the plurality of web pages includes the same or approximately the same content; determining, using one or more processors, a plurality of first evaluation values corresponding to respective ones of the plurality of web pages, wherein the plurality of first evaluation values comprises a plurality of first ranking parameter values; identifying a web page of the plurality of web pages as an original web page; identifying web pages other than the original web page of the plurality of web pages as reproduced web pages; determining a second evaluation value associated with the original web page based at least in part on a combination comprising; (1) a first evaluation value corresponding to the original web page from the plurality of first evaluation values; and (2) an aggregation of first evaluation values corresponding to the reproduced web pages from the plurality of first evaluation values; and ranking the original web page among the plurality of web pages based at least in part on the second evaluation value associated with the original web page. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer program product, the computer program product is embodied in a non-transitory computer readable storage medium and comprising computer instructions for:
-
obtaining a plurality of web pages with the same or approximately the same content, wherein each two web pages of the plurality of web pages includes the same or approximately the same content; determining a plurality of first evaluation values corresponding to respective ones of the plurality of web pages, wherein the plurality of first evaluation values comprises a plurality of first ranking parameter values; identifying a web page of the plurality of web pages as an original web page; identifying web pages other than the original web page of the plurality of web pages as reproduced web pages; determining a second evaluation value associated with the original web page based at least in part on a combination comprising; (1) a first evaluation value corresponding to the original web page from the plurality of first evaluation values; and (2) an aggregation of first evaluation values corresponding to the reproduced web pages from the plurality of first evaluation values; and ranking the original web page among the plurality of web pages based at least in part on the second evaluation value associated with the original web page.
-
Specification