System and method for identifying segments in a web resource
First Claim
Patent Images
1. A method of obtaining segment identifiers in an content page for use in displaying content, comprising:
- grouping items on the content page into segments based upon weightings of individual nodes;
merging smaller segments into larger segments based upon predefined criteria;
determining whether each of the remaining segments meet predefined constraints; and
if any of the remaining segments do not meet the predefined constraints, dividing the respective segments into sub-segments until the sub-segments meet the predefined constraints.
2 Assignments
0 Petitions
Accused Products
Abstract
A robust, lightweight, bottom-up segmentation method for Internet content. According to the present invention, individual segments are created based upon weights assigned according to document structure and markup elements and semantics. Smaller segments are then merged into larger segments by determining which portions of the content page are related to each other. The remaining segments are then intelligently divided based upon device constraints.
100 Citations
28 Claims
-
1. A method of obtaining segment identifiers in an content page for use in displaying content, comprising:
-
grouping items on the content page into segments based upon weightings of individual nodes;
merging smaller segments into larger segments based upon predefined criteria;
determining whether each of the remaining segments meet predefined constraints; and
if any of the remaining segments do not meet the predefined constraints, dividing the respective segments into sub-segments until the sub-segments meet the predefined constraints. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product for obtaining segment identifiers in an content page for use in displaying content, comprising:
-
computer code for grouping items on the content page into segments based upon weightings of individual nodes;
computer code for merging smaller segments into larger segments based upon predefined criteria;
computer code for determining whether each of the remaining segments meet predefined constraints; and
computer code for, if any of the remaining segments do not meet the predefined constraints, dividing the respective segments into sub-segments until the sub-segments meet the predefined device constraints. - View Dependent Claims (20, 21, 22, 23)
-
-
24. An electronic device, comprising:
-
a processor; and
a memory unit operatively connected to the processor and including computer program product for obtaining segment identifiers in an content page for use in displaying content on an electronic device, including;
computer code for grouping items on the content page into segments based upon weightings of individual nodes;
computer code for merging smaller segments into larger segments based upon predefined criteria;
computer code for determining whether each of the remaining segments meet predefined constraints; and
computer code for, if any of the remaining segments do not meet the predefined constraints, dividing the respective segments into sub-segments until the sub-segments meet the predefined constraints. - View Dependent Claims (25, 26, 27)
-
-
28. A system for obtaining segment identifiers in an content page for use in displaying content, comprising:
-
a remote terminal for transmitting an content page; and
an electronic device in communication with the remote terminal, the electronic device including a processor and a memory unit operatively connected to the processor, the memory unit comprising;
computer code for grouping items on the content page received from the remote terminal into segments based upon weightings of individual nodes;
computer code for merging smaller segments into larger segments based upon predefined criteria computer code for determining whether each of the remaining segments meet predefined constraints; and
computer code for, if any of the remaining segments do not meet the predefined constraints, dividing the respective segments into sub-segments until the sub-segments meet the predefined device constraints.
-
Specification