×

DOCUMENT LAYOUT EXTRACTION

  • US 20090144614A1
  • Filed: 12/03/2007
  • Published: 06/04/2009
  • Est. Priority Date: 12/03/2007
  • Status: Active Grant
First Claim
Patent Images

1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for extracting information from a document in an electronic format to produce a representation containing structure and layout metadata, the method comprising:

  • receiving one or more textual data in the electronic format;

    converting the textual data from the electronic format to an independent interface format, the independent interface format including coordinates to one or more structural elements of the textual data;

    performing a structure and layout analysis of the textual data to generate a set of structure and layout information; and

    storing the textual data and the set of structure and layout information in an enriched interface format, the enriched interface format providing for search and navigation of the textual data.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×