×

IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT

  • US 20090158138A1
  • Filed: 12/14/2007
  • Published: 06/18/2009
  • Est. Priority Date: 12/14/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving an electronic document that comprises a plurality of sections; and

    marking the plurality of sections as a content section or a non-content section using an attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section; and

    storing the marking of the plurality of sections of the electronic document in a machine-readable medium.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×