×

System and method for automatically gathering dynamic content and resources on the world wide web by stimulating user interaction and managing session information

  • US 6,665,658 B1
  • Filed: 01/13/2000
  • Issued: 12/16/2003
  • Est. Priority Date: 01/13/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. An automated method of gathering dynamic content and resources on the world wide web by simulating user interaction and managing session information, the method comprising the steps of:

  • providing a site database of dynamic websites requiring interaction to download contents thereof, said site database containing session data for the dynamic websites and document type definitions (“

    DTD”

    ) including descriptions of how to interact with the dynamic websites;

    identifying and retrieving at least one uniform resource locator (“

    URL”

    ) for a dynamic website to be analyzed;

    identifying and retrieving a session data and DTD for said URL from the site database;

    creating a query template for the retrieved URL using said identified DTD describing how to interact with the URL to simulate user interaction;

    identifying at least one search topic to be searched on said URL;

    inserting said at least one search topic into said query template to form a search query string querying said URL with said query string comprising said identified DTD and said at least one search topic;

    retrieving at least one result of said query, thereby automatically simulating user interaction with said dynamic website to gather and extract said at least one result.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×