Extracting a portion of a document, such as a web page
First Claim
1. A system comprising:
- an extraction computer system configured to cause;
determining, using scrolling adjusted coordinates, a location of a pointer in relation to nodes in a first formatted document, each node comprising a respective one or more formatting attributes defining a graphical representation of the node;
identifying a first subtree of the nodes based on a first one of the nodes being nearer to the pointer than one or more others of the nodes;
generating a second subtree of the nodes based on each node of the first subtree, each node of the second subtree having an identical one or more formatting attributes to a corresponding node of the first subtree;
compressing the second subtree, the compressed second subtree capable of being stored on a storage device;
extracting the compressed second subtree; and
displaying a presentation of the extracted second subtree in a second formatted document different from the first formatted document, the presentation of the extracted second subtree sharing characteristics of a presentation of the first subtree in the first formatted document.
2 Assignments
0 Petitions
Accused Products
Abstract
A portion data structure representing a portion extracted from a formatted source document is described. A portion data structure contains a first subtree of nodes that is modeled after a second subtree of a complete hierarchical representation of the formatted source document. Explicit formatting attribute values are specified for nodes of the first subtree only where a value calculated for the formatting attribute in a node of the first subtree differs from a value calculated for the formatting attribute in the corresponding node in the second subtree at a time when the node of the first subtree descends from a reset node specifying standardized formatting attribute values. The contents of the portion data structure are usable to display the portion extracted from the formatted source document in a context other than the formatted source document.
-
Citations
20 Claims
-
1. A system comprising:
-
an extraction computer system configured to cause; determining, using scrolling adjusted coordinates, a location of a pointer in relation to nodes in a first formatted document, each node comprising a respective one or more formatting attributes defining a graphical representation of the node; identifying a first subtree of the nodes based on a first one of the nodes being nearer to the pointer than one or more others of the nodes; generating a second subtree of the nodes based on each node of the first subtree, each node of the second subtree having an identical one or more formatting attributes to a corresponding node of the first subtree; compressing the second subtree, the compressed second subtree capable of being stored on a storage device;
extracting the compressed second subtree; anddisplaying a presentation of the extracted second subtree in a second formatted document different from the first formatted document, the presentation of the extracted second subtree sharing characteristics of a presentation of the first subtree in the first formatted document. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
determining, using scrolling adjusted coordinates, a location of a pointer in relation to nodes in a first formatted document, each node comprising a respective one or more formatting attributes defining a graphical representation of the node; identifying a first subtree of the nodes based on a first one of the nodes being nearer to the pointer than one or more others of the nodes; generating a second subtree of the nodes based on each node of the first subtree, each node of the second subtree having an identical one or more formatting attributes to a corresponding node of the first subtree; compressing the second subtree, the compressed second subtree capable of being stored on a storage device; extracting the compressed second subtree; and causing display of a presentation of the extracted second subtree in a second formatted document different from the first formatted document, the presentation of the extracted second subtree sharing characteristics of a presentation of the first subtree in the first formatted document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising computer-readable program code to be executed by one or more processors when retrieved from a non-transitory computer-readable medium, the program code including instructions configurable to cause:
-
determining, using scrolling adjusted coordinates, a location of a pointer in relation to nodes in a first formatted document, each node comprising a respective one or more formatting attributes defining a graphical representation of the node; identifying a first subtree of the nodes based on a first one of the nodes being nearer to the pointer than one or more others of the nodes; generating a second subtree of the nodes based on each node of the first subtree, each node of the second subtree having an identical one or more formatting attributes to a corresponding node of the first subtree; compressing the second subtree, the compressed second subtree capable of being stored on a storage device; extracting the compressed second subtree; and displaying a presentation of the extracted second subtree in a second formatted document different from the first formatted document, the presentation of the extracted second subtree sharing characteristics of a presentation of the first subtree in the first formatted document. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification