Extracting structured data from weblogs
First Claim
Patent Images
1. A method of extracting individual posts from a weblog, comprising the steps of:
- (a) accessing the home page of the weblog;
(b) identifying at least one feed associated with the weblog;
(c) determining whether the feed contains sufficient content for performing feed-guided segmentation;
(d) if the feed contains sufficient content for feed-guided segmentation, determining whether the feed contains full content or partial content of the weblog;
(e) if the feed contains full content of the weblog, mapping the data found in the feed into a representation for weblog posts; and
(f) if the feed contains partial content of the weblog, screen scraping the weblog into a representation for weblog posts using the feed data.
4 Assignments
0 Petitions
Accused Products
Abstract
A method of extracting individual posts from a weblog comprises the steps of: (a) providing a feed associated with the weblog; and (b) screen scraping the weblog into a representation for weblog posts using the feed data containing partial content of the weblog.
-
Citations
34 Claims
-
1. A method of extracting individual posts from a weblog, comprising the steps of:
-
(a) accessing the home page of the weblog;
(b) identifying at least one feed associated with the weblog;
(c) determining whether the feed contains sufficient content for performing feed-guided segmentation;
(d) if the feed contains sufficient content for feed-guided segmentation, determining whether the feed contains full content or partial content of the weblog;
(e) if the feed contains full content of the weblog, mapping the data found in the feed into a representation for weblog posts; and
(f) if the feed contains partial content of the weblog, screen scraping the weblog into a representation for weblog posts using the feed data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of extracting individual posts from a weblog, comprising the steps of:
-
(a) providing a feed associated with the weblog; and
(b) screen scraping the weblog into a representation for weblog posts using the feed data containing partial content of the weblog. - View Dependent Claims (29, 30, 31, 32, 33, 34)
-
Specification