×

System and method for extracting structured data from classified websites

  • US 8,682,881 B1
  • Filed: 09/07/2011
  • Issued: 03/25/2014
  • Est. Priority Date: 09/07/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method of automatically extracting data from a classified website comprising:

  • on a server system having one or more processors and memory storing one or more programs for execution by the one or more processors;

    determining that a website is an area specific classified website based at least in part upon determining that the website is geographically localized;

    accessing page models for other classified websites;

    identifying a listing page in the classified website based on similarity of the listing page to the page models;

    creating a listing page model for the listing page comprising;

    identifying one or more dynamic regions within the listing page;

    determining a type of information associated with a respective dynamic region of the one or more identified dynamic regions;

    creating a listing page template that identifies the one or more dynamic regions and their associated type of information; and

    storing the listing page template;

    extracting data from the classified website based at least in part on the listing page model; and

    saving the extracted data in a database responsive to a classified site query by a user.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×