×

Spatially directed crawling of documents

  • US 7,539,693 B2
  • Filed: 11/17/2004
  • Issued: 05/26/2009
  • Est. Priority Date: 02/22/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for populating a document repository with documents that are relevant to a spatial domain that has a spatial metric, said method comprising:

  • retrieving a document address from a page queue;

    loading into the document repository a document that is identified by the retrieved document address;

    parsing the loaded document for links to new documents;

    storing addresses of the new documents into the page queue, and for each address in the page queue, storing a spatial relevance level, the spatial relevance level being a measure of a document'"'"'s relevance to a location in the spatial domain; and

    iteratively repeating the steps of retrieving, loading, parsing and storing to populate the document repository, and wherein retrieving involves using the spatial relevance levels of the stored addresses in the page queue to determine which document addresses are retrieved from the page queue.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×