Using network traffic logs for search enhancement
First Claim
Patent Images
1. A computer-implemented method comprising the steps of:
- receiving, at a search engine operating on one or more computing devices from a traffic server that is not the search engine and is not operating on the one or more computing devices, network usage information that indicates a frequency with which users of a plurality of users requested, during a particular period of time, a particular document through the traffic server, which monitors frequencies with which documents in a plurality of documents are provided, through the traffic server, from a plurality of origin servers upon which the plurality of documents reside, to the plurality of users in response to a plurality of document-identifying requests from the plurality of users;
the search engine positioning the particular document within a queue at a queue position that is selected based on said network usage information;
the search engine performing a crawling or indexing operation based at least in part on the position of the particular document within the queue.
8 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for using network traffic logs for search enhancement is disclosed. According to one embodiment, network usage is tracked by generating log files. These log files among other things indicate the frequency web pages are referenced and modified. These log files or information from these log files can then be used to improve document ranking, improve web crawling, determine tiers in a multi-tiered index, determine where to insert a document in a multi-tiered index, determine link weights, and update a search engine index.
-
Citations
18 Claims
-
1. A computer-implemented method comprising the steps of:
-
receiving, at a search engine operating on one or more computing devices from a traffic server that is not the search engine and is not operating on the one or more computing devices, network usage information that indicates a frequency with which users of a plurality of users requested, during a particular period of time, a particular document through the traffic server, which monitors frequencies with which documents in a plurality of documents are provided, through the traffic server, from a plurality of origin servers upon which the plurality of documents reside, to the plurality of users in response to a plurality of document-identifying requests from the plurality of users; the search engine positioning the particular document within a queue at a queue position that is selected based on said network usage information; the search engine performing a crawling or indexing operation based at least in part on the position of the particular document within the queue. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium storing one or more sequences of instructions, wherein execution of the one or more sequences of instructions by one or more processors causes the one or more processors to perform the steps of:
-
receiving, at a search engine operating on one or more computing devices from a traffic server that is not the search engine and is not operating on the one or more computing devices, network usage information that indicates a frequency with which users of a plurality of users requested, during a particular period of time, a particular document through the traffic server, which monitors frequencies with which documents in a plurality of documents are provided, through the traffic server, from a plurality of origin servers upon which the plurality of documents reside, to the plurality of users in response to a plurality of document-identifying requests from the plurality of users; positioning the particular document within a queue at a queue position that is selected based on said network usage information; performing a crawling or indexing operation based at least in part on the position of the particular document within the queue. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification