×

Operator-guided application crawling architecture

  • US 10,146,785 B2
  • Filed: 09/28/2015
  • Issued: 12/04/2018
  • Est. Priority Date: 05/13/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system for automated acquisition of content from an application, the system comprising:

  • a guide tracker module configured to monitor interaction of an operator with an executing instance of the application and record a set of guides, wherein each guide in the set of guides includes a recorded sequence of user interface interactions concluding at a respective ultimate state of the application;

    a link extraction controller configured to, for each guide of the set of guides;

    selectively identify additional states of the application that correspond to the respective ultimate state andadd the additional states corresponding to the respective ultimate state and the respective ultimate state to a state list,wherein the additional states and the respective ultimate state are all directly reachable from a common penultimate state of the application,wherein the common penultimate state of the application is immediately prior to the respective ultimate state in the guide, andwherein each entry in the state list designates (i) a state and (ii) a path of user interface interactions to arrive at the state; and

    a scraper module configured to, within an executing instance of the application, extract text and metadata from the states designated by each of the entries in the state list, wherein information based on the extracted text and metadata is stored in a data store.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×