×

Detecting a network crawler

  • US 10,182,046 B1
  • Filed: 06/23/2015
  • Issued: 01/15/2019
  • Est. Priority Date: 06/23/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving, by a computer system associated with an electronic marketplace, a request for a first web page of a web site of the electronic marketplace, the request received from a computing device;

    inserting, by the computer system in the first web page, a universal resource locator (URL) of a second web page of the web site and code, the second web page inaccessible to web crawlers based at least in part on a robots exclusion protocol of the web site identifying the second web page, the URL inserted in markup language of the first web page, the code comprising statements of a programmatic scripting language in accordance with an ECMAScript standard and configured to, upon execution of the code at the computing device;

    determine a presence or absence of the URL in a browser history stored at the computing device and, if the URL is present in the browser history, one or more of;

    a state or a style attribute of the URL from the browser history, anddetermine, based at least in part on the presence or absence of the URL, whether the second web page was accessed by the computing device;

    providing, by the computer system, the first web page to the computing device based at least in part on the request;

    receiving, by the computer system from the computing device, an indication that the second web page was not accessed prior to providing the first web page to the computing device, the indication received based at least in part on a determination that the URL is absent from the browser history or on a determination of the state or style attribute of the URL if the URL is present in the browser history, the determination based at least in part on an execution of the code at the computing device; and

    determining, based at least in part on the indication, that the request for the first web page is associated with a web crawler hosted on the computing device.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×