Small Form Factor Web Browsing
First Claim
1. A computer architecture to adapt web content for display on a small form factor device having a display screen with a width, the architecture comprising:
- a high level structure analysis module executed on one or more processors to receive a web page and to analyze a markup language tag tree of a markup language document representing the web page to identify;
peripheral regions of the web page including a header, a footer, a left, and a right regions; and
body regions enclosed by the peripheral regions;
a low level structure analysis module executed on the one or more processors to analyze the markup language tag tree defining the peripheral regions and the body regions to;
identify visual boundaries from analyzing properties of tags of the markup language tag tree;
detect patterns in leaf markup language tags to find one or more basic semantic units each having a shape;
project the shape of each basic semantic unit normal to perpendicular axes; and
identify the visual boundaries of the web page based on projection values for each axis.
2 Assignments
0 Petitions
Accused Products
Abstract
A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.
10 Citations
20 Claims
-
1. A computer architecture to adapt web content for display on a small form factor device having a display screen with a width, the architecture comprising:
-
a high level structure analysis module executed on one or more processors to receive a web page and to analyze a markup language tag tree of a markup language document representing the web page to identify; peripheral regions of the web page including a header, a footer, a left, and a right regions; and body regions enclosed by the peripheral regions; a low level structure analysis module executed on the one or more processors to analyze the markup language tag tree defining the peripheral regions and the body regions to; identify visual boundaries from analyzing properties of tags of the markup language tag tree; detect patterns in leaf markup language tags to find one or more basic semantic units each having a shape; project the shape of each basic semantic unit normal to perpendicular axes; and identify the visual boundaries of the web page based on projection values for each axis. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A server comprising:
-
a memory; a processor coupled to the memory configured to communicate with a small form factor client through a network, for performing acts comprising; using a markup language tag tree of a web page to extract regions of the web page including a header, a footer, a left, and a right side bar regions, and one or more body regions adjacent to the header, the footer, the left, and the right side bar regions; identifying visual boundaries within the header, the footer, the left, and the right side bar regions by; analyzing properties of tags of the markup language tag tree; grouping one or more objects associated with the header, the footer, the left, and the right side bar regions into a shape; projecting the shape normal to an axis so that the shape is represented by one or more semantic units of tags of the markup language tag tree in each region; and determining additional visual boundaries from projection values for the axis; and identifying a plurality of sub-pages of the web page using the identified visual boundaries and the header, the footer, the left, and the right side bar regions, where each sub-page has a width not greater than a predetermined width of the display screen. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A method implemented at least in part by a processor, the method comprising:
-
partitioning a markup language document representing a web page into one or more sub-pages; deriving from the markup language document of the web page a hierarchy of one or more logical blocks for each sub-page, such that; each logical block of each sub-page is a function of a semantic structure of the markup language document; and the hierarchy of the one or more logical blocks of each sub-page does not correspond to one markup language tree tag; generating a thumbnail image of the web page having partitions corresponding to the one or more sub-pages; and associating a hyperlink to a sub-page from a corresponding partition of the thumbnail image of the web page. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification