×

Adaptive Web crawling using a statistical model

  • US 20050165778A1
  • Filed: 12/22/2004
  • Published: 07/28/2005
  • Est. Priority Date: 01/28/2000
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for selectively accessing a document during a current crawl of a server computer, the document being identified by a document address specification, the document having been retrieved during a previous crawl, the method comprising:

  • determining whether to access the document during the current crawl with the aid of a probabilistic model that is based on the probability that the document has changed since the previous crawl; and

    accessing the document if the determination produces an instruction indicative that the document at the document address specification should be accessed during the current crawl.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×