Methods and systems for prioritizing a crawl
First Claim
1. A computer-implemented method for indexing articles, the method comprising the following operations performed by at least one processor:
- identifying a plurality of directories located on a client device, each directory of the plurality of directories storing a plurality of articles, wherein at least one or more of the directories store articles of different types;
determining, for each article of the plurality of articles stored within each directory of the plurality of directories, at least one associated event performed with the client device for the article;
ranking the plurality of directories identified on the client device, the ranking of each directory based at least in part on the associated events for each article of the plurality of articles stored in the directory, wherein the ranking of each directory of the one or more directories that store articles of different types is based on the events associated with the each of the articles of different types stored in the directory; and
in response to ranking the plurality of directories identified on the client device;
crawling the plurality of directories based at least in part on the ranking of the plurality of directories, wherein crawling the plurality of directories comprises retrieving multiple different articles from each directory of the plurality of directories; and
indexing the plurality of articles stored in the plurality of directories on the client device based on the crawling of the plurality of directories identified on the client device.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for prioritizing a crawl are described. One aspect of the invention includes a method for identifying a plurality of storage locations each comprising a plurality of articles, ranking the plurality of storage locations based at least in part on events associated with the plurality of articles, and crawling the storage locations based at least in part on the ranking. Another aspect of the invention includes identifying a plurality of storage locations each comprising a plurality of articles, identifying a plurality of types of the plurality of articles, ranking the plurality of storage locations based at least in part on the plurality of types of the plurality of articles; and crawling the storage locations based at least in part on the ranking.
-
Citations
20 Claims
-
1. A computer-implemented method for indexing articles, the method comprising the following operations performed by at least one processor:
-
identifying a plurality of directories located on a client device, each directory of the plurality of directories storing a plurality of articles, wherein at least one or more of the directories store articles of different types; determining, for each article of the plurality of articles stored within each directory of the plurality of directories, at least one associated event performed with the client device for the article; ranking the plurality of directories identified on the client device, the ranking of each directory based at least in part on the associated events for each article of the plurality of articles stored in the directory, wherein the ranking of each directory of the one or more directories that store articles of different types is based on the events associated with the each of the articles of different types stored in the directory; and in response to ranking the plurality of directories identified on the client device; crawling the plurality of directories based at least in part on the ranking of the plurality of directories, wherein crawling the plurality of directories comprises retrieving multiple different articles from each directory of the plurality of directories; and indexing the plurality of articles stored in the plurality of directories on the client device based on the crawling of the plurality of directories identified on the client device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented system comprising:
-
a non-transitory computer-readable medium that stores program code; and at least one processor that executes the program code to perform operations comprising; identifying a plurality of directories located on a client device, each directory of the plurality of directories storing a plurality of articles, wherein at least one or more of the directories store articles of different types; determining, for each article of the plurality of articles stored within each directory of the plurality of directories, at least one associated event performed with the client device for the article; ranking the plurality of directories identified on the client device, the ranking of each directory based at least in part on the associated events for each article of the plurality of articles stored in the directory, wherein the ranking of each directory of the one or more directories that store articles of different types is based on the events associated with each of the articles of different types stored in the directory; and in response to ranking the plurality of directories identified on the client device; crawling the plurality of directories based at least in part on the ranking of the plurality of directories, wherein crawling the plurality of directories comprises retrieving multiple different articles from each directory of the plurality of directories; and indexing the plurality of articles stored in the plurality of directories on the client device based on the crawling of the plurality of directories identified on the client device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing instructions, that, when executed by at least one processor, cause the at least one processor to perform operations comprising:
-
identifying a plurality of directories located on a client device, each directory of the plurality of directories storing a plurality of articles, wherein at least one or more of the directories store articles of different types; determining, for each of the article of the plurality of articles stored within each directory of the plurality of directories, at least one associated event performed with the client device for the article; ranking the plurality of directories identified on the client device, the ranking of each directory based at least in part on the associated events for each article of the plurality of articles in each stored in the directory, wherein the ranking of each directory of the one or more directories that store articles of different types is based on the events associated with each of the articles of different types stored in the directory; and in response to ranking the plurality of directories identified on the client device; crawling the plurality of directories based at least in part on the ranking of the plurality of directories, wherein crawling the plurality of directories comprises retrieving multiple different articles from each directory of the plurality of directories; and indexing the plurality of articles stored in the plurality of directories on the client device based on the crawling of the plurality of directories identified on the client device. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification