Small form factor web browsing
First Claim
1. A method comprising:
- using a markup language tag tree of a web page to extract regions of the web page including a header, a footer, left and right side bar regions, and one or more body regions encompassed by the header region, the footer region, the left side bar region, and the right side bar region;
identifying visual boundaries within each said region by;
properties of the tags of the markup language tag tree;
projecting normal to an axis each shape represented by one or more semantic units of the tags of the markup language tag tree in each region; and
determining the additional visual boundaries from the projection values on the axis,wherein the one or more semantic units of the tags of the markup language tag tree in each region is found by clustering the leaf markup language tags in a pattern detection algorithm.
2 Assignments
0 Petitions
Accused Products
Abstract
A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.
-
Citations
14 Claims
-
1. A method comprising:
-
using a markup language tag tree of a web page to extract regions of the web page including a header, a footer, left and right side bar regions, and one or more body regions encompassed by the header region, the footer region, the left side bar region, and the right side bar region; identifying visual boundaries within each said region by; properties of the tags of the markup language tag tree; projecting normal to an axis each shape represented by one or more semantic units of the tags of the markup language tag tree in each region; and determining the additional visual boundaries from the projection values on the axis, wherein the one or more semantic units of the tags of the markup language tag tree in each region is found by clustering the leaf markup language tags in a pattern detection algorithm. - View Dependent Claims (2, 3)
-
-
4. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more processors, cause the one or more processors to implement a method comprising:
-
using a markup language tag tree of a web page to extract regions of the web page includihg a header, a footer, left and right side bar regions, and one or more body regions encompassed by the header region, the footer region, the left side bar region, and the right side bar region; identifying visual boundaries within each said region by; properties of the tags of the markup language tag tree; projecting normal to an axis each shape represented by one or more semantic units of the tags of the markup language tag tree in each region; and determining the additional visual boundaries from the projection values on the axis, wherein the one or more semantic units of the tags of the markup language tag tree in each region is found by clustering the leaf markup language tags in a pattern detection algorithm. - View Dependent Claims (5, 6)
-
-
7. A method comprising:
-
analyzing a markup language tag tree of a web page to identify; peripheral regions of the web page including header, footer, left, and right regions; and one or more body regions adjacent to at least one said peripheral region; within the markup language tag tree that defines each said peripheral and body region; identifying visual boundaries given in the properties of the tags of the markup language tag tree; and identifying blank areas by; analyzing one or more functions on the basis of a layout structure of each said function by; configuring each said function into a rectangle; projecting each rectangle normally onto each of perpendicular axes; and determining one or more separators that are each normal to the axes as a function of the sum of the projections on each axis, wherein the identifying blank areas further comprises; processing the markup language tag tree defining each said peripheral and body region to identify therein one or more objects each having one or more properties relating to at least one function thereof; and analyzing each said object in each said peripheral and body region by; grouping the object into a shape that contains one or more properties relating to at least one function; separating each said shape from the other shapes by a separator, wherein each said shape is a rectangle having sides parallel or normal to that of the other rectangles and to the separators; inserting blanks between the shapes; projecting each of the shapes onto two perpendicular axes; quantifying the projections along each said axis; and identifying each said separator as being normal to each section along each said axis where the quantity of the projections is less than a predetermined threshold. - View Dependent Claims (8, 9, 10)
-
-
11. One or more computer-readable media having computer-readable instructions thereon which, when executed by one or more processors, cause the one or more processors to implement a method comprising:
-
analyzing a markup language tag free of a web page to identify; peripheral regions of the web page including header, footer, left, and right regions; and one or more body regions adjacent to at least one said peripheral region; within the markup language tag tree that defines each said peripheral and body region; identifying visual boundaries given in the properties of the tags of the markup language tag tree; and identifying blank areas by; analyzing one or more functions on the basis of a layout structure of each said function by; configuring each said function into a rectangle; projecting each rectangle normally onto each of perpendicular axes; and determining one or more separators that are each normal to the axes as a function of the sum of the projections on each axis, wherein the identifying blank areas further comprises; processing the markup language tag tree defining each said peripheral and body region to identify therein one or more objects each having one or more properties relating to at least one function thereof; and analyzing each said object in each said peripheral and body region by; in the object into a shape that contains one or more properties relating to at least one function; separating each said shape from the other shapes by a separator, wherein each said shape is a rectangle having sides parallel or normal to that of the other rectangles and to the separators; inserting blanks between the shapes; projecting each of the shapes onto two perpendicular axes; quantifying the projections along each said axis; and identifying each said separator as being normal to each section along each said axis where the quantity of the projections is less than a predetermined threshold. - View Dependent Claims (12, 13, 14)
-
Specification