×

INTERACTIVE WEB CRAWLER

  • US 20120323881A1
  • Filed: 06/17/2011
  • Published: 12/20/2012
  • Est. Priority Date: 06/17/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of web crawling hidden files, comprising:

  • loading a web page with a browser agent;

    executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values;

    retrieving a list of form controls from the web page using the browser agent;

    analyzing the controls using a driver component;

    sending form control values from the driver component to the browser agent;

    submitting an event to the web page by the browser agent or running any scripted content to trigger operations on the web page corresponding to the form control values; and

    generating a URL for various form control values using a generalizer.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×