Methods and systems for prioritizing a crawl
First Claim
1. A computer-implemented method comprising the following operations performed by one or more processors:
- identifying a plurality of directories located on a client device, each of the directories storing a plurality of articles of different types;
determining, for each of the article types, a number of the articles that have the same article type in each identified directory;
ranking the directories in an order, with respect to each article type, based at least in part on the determined number of articles in each directory having the same article type;
crawling the directories based at least in part on the ranking; and
indexing the plurality of articles based on the crawling.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for prioritizing a crawl are described. One aspect of the invention includes a method for identifying a plurality of storage locations each comprising a plurality of articles, ranking the plurality of storage locations based at least in part on events associated with the plurality of articles, and crawling the storage locations based at least in part on the ranking. Another aspect of the invention includes identifying a plurality of storage locations each comprising a plurality of articles, identifying a plurality of types of the plurality of articles, ranking the plurality of storage locations based at least in part on the plurality of types of the plurality of articles; and crawling the storage locations based at least in part on the ranking.
-
Citations
23 Claims
-
1. A computer-implemented method comprising the following operations performed by one or more processors:
-
identifying a plurality of directories located on a client device, each of the directories storing a plurality of articles of different types; determining, for each of the article types, a number of the articles that have the same article type in each identified directory; ranking the directories in an order, with respect to each article type, based at least in part on the determined number of articles in each directory having the same article type; crawling the directories based at least in part on the ranking; and indexing the plurality of articles based on the crawling. - View Dependent Claims (2, 3, 4, 5, 6, 7, 21)
-
-
8. A computer-implemented method comprising the following operations performed by one or more processors:
-
identifying a plurality of storage locations located on a client device, each of the storage locations storing a plurality of articles of different types; determining, for each of the article types, a number of the articles having the same article type in each identified storage location; ranking the storage locations in an order, with respect to each article type, based at least in part on the determined number of articles in each storage location having the same article type; crawling the storage locations based at least in part on the ranking; and indexing the plurality of articles based on the crawling. - View Dependent Claims (9, 10, 11, 12, 13, 20, 22)
-
-
14. A non-transitory computer-readable storage medium storing instructions that, when executed by at least one processor, configure the at least one processor to perform operations comprising:
-
identifying a plurality of storage locations located on a client device, each of the storage locations storing a plurality of articles of different types; determining, for each of the article types, a number of the articles having the same article type in each identified storage location; ranking the storage locations in an order, with respect to each article type, based at least in part on the determined number of articles in each storage location having the same article type; crawling the storage locations based at least in part on the ranking; and indexing the plurality of articles based on the crawling. - View Dependent Claims (15, 16, 17, 18, 19, 23)
-
Specification