×

Extracting structured data from weblogs

  • US 9,158,855 B2
  • Filed: 06/16/2006
  • Issued: 10/13/2015
  • Est. Priority Date: 06/16/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method of extracting individual posts from a weblog, comprising:

  • accessing a home page of the weblog;

    identifying at least one feed associated with the weblog;

    determining whether the at least one feed contains sufficient content for feed-guided segmentation;

    if the at least one feed contains sufficient content for feed-guided segmentation, determining whether the at least one feed contains full content or partial content of the weblog;

    if the at least one feed contains full content of the weblog, mapping data found in the at least one feed into a representation for weblog posts; and

    if the at least one feed contains partial content of the weblog, screen scraping the weblog into a representation for weblog posts using the data.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×