×

Extracting a portion of a document, such as a web page

  • US 9,430,583 B1
  • Filed: 06/10/2011
  • Issued: 08/30/2016
  • Est. Priority Date: 06/10/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method caused to be performed by at least one computing system having a processor, the method comprising:

  • generating a list of nodes in a rendered version of a subject web page, each node having a respective location score having a value identical to other location scores of each node in the list;

    determining a location of a pointer displayed in relation to the rendered version of the subject web page according to scrolling adjusted coordinates of the displayed pointer, the location indicating a first node of the list of nodes, the first node having a first location score reduced by a factor;

    for each node of the list of nodes, updating the respective location score, the updated respective location score for each node having an updated value based on a distance of the pointer from a corner associated with the node and a square root of an area of the node;

    identifying, based on a lowest updated value of the updated location scores, a portion of the rendered version of the subject web page corresponding to at least one subtree of a document object model tree created for the subject web page;

    establishing in the document object model tree a reset node comprising a stylesheet specifying a predetermined standardized set of formatting attribute values inheritable by descendents of the reset node;

    for each subtree of the document object model tree created for the subject web page to which the identified first node of the rendered version of the subject web page corresponds;

    traversing the subtree;

    for each node of the subtree visited during the traversal;

    establishing a corresponding node as a descendent of the reset node, the established corresponding node having a type matching a type of the node of the subtree;

    where the node of the subtree has calculated values for any of a plurality of formatting attributes, for each of the plurality of formatting attributes;

    determining a calculated value of the formatting attribute in the node of the subtree;

    determining a calculated value of the formatting attribute in the corresponding node, the determined calculated value of the formatting attribute in the corresponding node being inherited from the predetermined standardized set of formatting attribute values;

    determining that the calculated values differ; and

    only when it is determined that calculated values differ, explicitly specifying for the corresponding node the determined calculated value of the formatting attribute in the node of the subtree.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×