Please download the dossier by clicking on the dossier button x
×

System and method for efficient representation of data set addresses in a web crawler

  • US 6,301,614 B1
  • Filed: 11/02/1999
  • Issued: 10/09/2001
  • Est. Priority Date: 11/02/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of downloading data sets from among a plurality of host computers, comprising the steps of:

  • (a) storing representations of data set addresses in a set of data structures, including a first cache, a second cache, and a disk file;

    (b) downloading at least one data set that includes addresses of one or more referred data sets;

    (c) identifying the addresses of the one or more referred data sets, and (d) for each identified address;

    (d1) generating a fixed-length representation of the identified address;

    (d2) determining first whether the representation of the identified address is stored in the first cache, and when the first determination is negative determining second whether the representation of the identified address is stored in the second cache, and when the second determination is negative determining third whether the representation of the identified address is stored in the disk file;

    (d3) when the third determination is negative, storing the representation of the identified address in the second cache and scheduling the corresponding data set for downloading; and

    (d4) when the third determination is positive, storing the representation of the identified address in the first cache.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×