Systems and Methods to Control Web Scraping
First Claim
Patent Images
1. A method to selectively deter web scraping, the method comprising:
- storing information on a first database;
providing a web interface to access the information stored in the first database, the web interface including a plurality of web servers capable to independently serve a current web request through retrieving the information from the first database;
in response to the current web request received at one of the web servers, determining identification information indicating a requester of the current web request;
logging the current web request in association with the requester in a second database;
accessing the second database, in response to the current web request, for past web requests of the requester logged in the second database, at least a portion of the past web requests of the requester received via different ones of web servers;
determining whether to block the current web request, to delay the current web request, or to timely reply to the current web request, based on a characteristic of the past web requests logged in the second database; and
in response to a determination to reply to the current web request, providing the information to the requester through one of web servers.
18 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to control web scraping through a plurality of web servers using real time access statistics are described.
39 Citations
20 Claims
-
1. A method to selectively deter web scraping, the method comprising:
-
storing information on a first database; providing a web interface to access the information stored in the first database, the web interface including a plurality of web servers capable to independently serve a current web request through retrieving the information from the first database; in response to the current web request received at one of the web servers, determining identification information indicating a requester of the current web request; logging the current web request in association with the requester in a second database; accessing the second database, in response to the current web request, for past web requests of the requester logged in the second database, at least a portion of the past web requests of the requester received via different ones of web servers; determining whether to block the current web request, to delay the current web request, or to timely reply to the current web request, based on a characteristic of the past web requests logged in the second database; and in response to a determination to reply to the current web request, providing the information to the requester through one of web servers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A machine readable media embodying instructions, the instructions causing a data processing system to perform a method, the method comprising:
-
receiving a current web request at a web interface to access information stored on a first database, the web interface including a plurality of web servers capable to independently serve web requests through retrieving the information from the first database; in response to the current web request received at one of the web servers, determining identification information indicating a requester of the current web request; logging the current web request in association with the requester in a second database; accessing the second database, in response to the current web request, for past web requests of the requester logged in the second database, at least a portion of past web requests of the requester received via different ones of web servers; determining whether to block the current web request, to delay the current web request, or to timely reply to the current web request, based on a characteristic of the past web requests logged in the second database; and in response to a determination to reply to the current web request, providing the information to the requester through one of web servers.
-
-
20. A data processing system, comprising:
-
means for receiving a current web request at a web interface to access information stored on a first database, the web interface including a plurality of web servers capable to independently serve web requests through retrieving the information from the first database; means for determining identification information indicating a requester of the current web request, in response the current web request received at one of the web servers; means for logging the current web request in association with the requester in a second database, in response to the current web request received at one of the web servers; means for accessing the second database, in response to the current web request, for past web requests of the requester logged in the second database, at least a portion of the past web requests of the requester received via different ones of web servers; means for determining whether to block the current web request, to delay the current web request, or to timely reply to the current web request, based on a characteristic of the past web requests logged in the second database; and means for providing the information to the requester through one of web servers, in response to a determination to reply to the current web request.
-
Specification