×

Scheduler for search engine crawler

  • US 8,775,403 B2
  • Filed: 04/17/2012
  • Issued: 07/08/2014
  • Est. Priority Date: 07/03/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method of scheduling document indexing, comprising:

  • at a computing system having one or more processors and memory storing programs for execution by the one or more processors;

    retrieving a number of document identifiers, each document identifier identifying a corresponding document on a network; and

    for each retrieved document identifier and its corresponding document,determining a query-independent score indicative of a rank of the corresponding document relative to other documents in a set of documents;

    determining a first score for the document identifier that is a function of the determined query-independent score, a determined content change frequency of the corresponding document, and an age of the corresponding document;

    comparing the first score against a threshold value thereby obtaining a result, wherein the threshold value is a function of a speed of the engine crawler system; and

    conditionally scheduling the corresponding document for indexing based on the result.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×