×

Identification of web sites that contain session identifiers

  • US 7,886,217 B1
  • Filed: 09/29/2003
  • Issued: 02/08/2011
  • Est. Priority Date: 09/29/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method for crawling documents, performed by one or more server devices, the method comprising:

  • receiving, by one or more processors associated with the one or more server devices, a uniform resource locator (URL);

    receiving, by one or more processors associated with the one or more server devices, at least two different copies of a document associated with the URL; and

    determining, by one or more processors associated with the one or more server devices, whether a web site corresponding to the URL uses session identifiers based on a comparison of URLs that are within the document and that change between the at least two different copies of the document, where the web site is determined to use session identifiers when a portion of the URLs that change between the at least two different copies of the document is greater than a threshold.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×