×

Harvesting data from page

  • US 8,924,838 B2
  • Filed: 08/07/2007
  • Issued: 12/30/2014
  • Est. Priority Date: 08/09/2006
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for automatically mining data, the method comprising:

  • identifying, a portion of the page which includes the data to be mined;

    determining that the portion of the page links to another page;

    identifying, a portion of the linked page which includes the data to be mined;

    wherein, the identifying portions of the pages includes;

    using a feed representation created for the page by an entity;

    selecting the portions of the pages as having the data to be mined, wherein the selected portions of the pages include content that is referenced in the feed representation;

    retrieving and storing, the selected portions of the pages as having the data to be mined;

    data mining the selected portions of the pages that have been retrieved and stored;

    identifying a first portion of a second page that is referenced in the feed representations and a second portion of the second page that is not referenced in the feed representation;

    retrieving and storing the second portion and not the first portion of the second page.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×