Extraction and analysis of user-generated content
First Claim
1. A computer system including instructions stored on a computer-readable medium, the computer system comprising:
- at least one processor;
a page splitter configured to cause the at least one processor of the computer system to split a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts;
an aggregator configured to cause the at least one processor of the computer system to associate the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type; and
a merger configured to cause the at least one processor of the computer system to merge the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site.
2 Assignments
0 Petitions
Accused Products
Abstract
A page splitter may be configured to split a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts. An aggregator may be configured to associate the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type. A merger may be configured to merge the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site.
55 Citations
25 Claims
-
1. A computer system including instructions stored on a computer-readable medium, the computer system comprising:
-
at least one processor; a page splitter configured to cause the at least one processor of the computer system to split a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts; an aggregator configured to cause the at least one processor of the computer system to associate the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type; and a merger configured to cause the at least one processor of the computer system to merge the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method including executing instructions stored on a computer-readable medium, the method comprising:
-
splitting a first page of a site into first boilerplate and first posts and a second page of the site into second boilerplate and second posts; associating the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate; associating at least one of the first posts with a first post-type, at least one of the second posts with the first post-type, and at least one of the second posts with a second post-type; merging the first boilerplate and the second boilerplate into a boilerplate template; merging posts of the first post-type from the first page and from the second page into a first post-type template; merging posts of the second post-type from the second page into a second post-type template; and merging the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computer system comprising:
- one or more computing devices programmed to perform operations including;
splitting pages of a site containing user-generated content into boilerplate and post data, aggregating a page group of the split pages, based on similarity of boilerplate contained within the page group, aggregating groups of the posts within and among the pages of the page group, based on a post-type of posts within each group of posts, analyzing posts of each post-type and corresponding post group, to extract elements therefrom, and annotating the elements based on annotation criteria for inclusion within post-type templates corresponding to each post-type and corresponding post group. - View Dependent Claims (18, 19, 20)
- one or more computing devices programmed to perform operations including;
-
21. A computer system comprising:
one or more computing devices programmed to perform operations including splitting a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts; associating the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type; and merging the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site. - View Dependent Claims (22, 23, 24, 25)
Specification