Small form factor web browsing
First Claim
1. A method comprising:
- analyzing, on a computer, a markup language tag tree of a web page to identify peripheral regions around periphery of the web page and any body regions between the peripheral regions, wherein the peripheral regions comprise header, footer, left, and right regions, the left and right regions are defined, respectively, as being the left and right one third of the web page; and
the header and footer regions are defined, respectively, as being in an upper N and lower N pixels of the web page, wherein;
each of the header and footer regions contains one or more markup language tag regions; and
each markup language tag region has a height and width;
N is a dynamic threshold for the header region determined by a base threshold added to a function of the height of the tag region divided by the width of the tag region;
within the markup language tag tree defining each peripheral and body region;
identifying a first set of visual boundaries from properties of the tags of the markup language tag tree; and
detecting patterns in leaf markup language tags of the markup language tag tree to find one or more basic semantic units each having a shape;
projecting the shape of each basic semantic unit normal to perpendicular axes;
identifying a second set of visual boundaries of the web page from the projection values on each perpendicular axis;
within the markup language tag tree that defines each peripheral and body region, identifying each logical block that is adjacent to any of the first set or the second set of identified visual boundaries; and
storing, using an annotation mechanism, a representation of the web page including;
the logical blocks within each peripheral and body region;
the peripheral regions;
the body regions; and
the visual boundaries.
2 Assignments
0 Petitions
Accused Products
Abstract
A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.
63 Citations
17 Claims
-
1. A method comprising:
-
analyzing, on a computer, a markup language tag tree of a web page to identify peripheral regions around periphery of the web page and any body regions between the peripheral regions, wherein the peripheral regions comprise header, footer, left, and right regions, the left and right regions are defined, respectively, as being the left and right one third of the web page; and the header and footer regions are defined, respectively, as being in an upper N and lower N pixels of the web page, wherein; each of the header and footer regions contains one or more markup language tag regions; and each markup language tag region has a height and width; N is a dynamic threshold for the header region determined by a base threshold added to a function of the height of the tag region divided by the width of the tag region; within the markup language tag tree defining each peripheral and body region; identifying a first set of visual boundaries from properties of the tags of the markup language tag tree; and detecting patterns in leaf markup language tags of the markup language tag tree to find one or more basic semantic units each having a shape; projecting the shape of each basic semantic unit normal to perpendicular axes; identifying a second set of visual boundaries of the web page from the projection values on each perpendicular axis; within the markup language tag tree that defines each peripheral and body region, identifying each logical block that is adjacent to any of the first set or the second set of identified visual boundaries; and storing, using an annotation mechanism, a representation of the web page including; the logical blocks within each peripheral and body region; the peripheral regions; the body regions; and the visual boundaries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-readable storage device including computer instructions executable on a computer to perform the acts comprising:
-
analyzing a markup language tag tree of a web page to identify peripheral regions around periphery of the web page and any body regions between the peripheral regions, wherein the peripheral regions comprise header, footer, left, and right regions, the left and right regions are defined, respectively, as being the left and right one third of the web page; and the header and footer regions are defined, respectively, as being in an upper N and lower N pixels of the web page, wherein; each of the header and footer regions contains one or more markup language tag regions; and each markup language tag region has a height and width; N is a dynamic threshold for the header region determined by a base threshold added to a function of the height of the tag region divided by the width of the tag region; within the markup language tag tree defining each peripheral and body region; identifying a first set of visual boundaries from properties of the tags of the markup language tag tree; and detecting patterns in leaf markup language tags of the markup language tag tree to find one or more basic semantic units each having a shape; projecting the shape of each basic semantic unit normal to perpendicular axes; identifying a second set of visual boundaries of the web page from the projection values on each perpendicular axis; within the markup language tag tree that defines each peripheral and body region, identifying each logical block that is adjacent to any of the first set or the second set of identified visual boundaries; and storing, using an annotation mechanism, a representation of the web page including; the logical blocks within each peripheral and body region; the peripheral regions; the body regions; and the visual boundaries. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification