Please download the dossier by clicking on the dossier button x
×

Content profiling to dynamically configure content processing

  • US 8,473,467 B2
  • Filed: 06/07/2009
  • Issued: 06/25/2013
  • Est. Priority Date: 01/02/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method for defining a program for reconstructing a document, the method comprising:

  • defining a default set of document reconstruction operations for (i) identifying sets of primitive elements in an unstructured document that comprises a plurality of unassociated primitive elements and (ii) defining associations between the sets of primitive elements as structural elements in order to define a structured document from the unstructured document, wherein the primitive elements comprise at least one of glyphs and vector graphics;

    defining a hierarchical set of profiles, each particular profile comprising (i) a set of clauses specifying potential results of previously-performed document reconstruction operations and (ii) instructions for modifying the set of document reconstruction operations to perform when actual results of the previously-performed document reconstruction operations match the set of clauses for the particular profile in order to define a modified set of document reconstruction operations different from the default set of document reconstruction operations, the results of the previously-performed document reconstruction operations comprising the structural elements defined as associations between sets of primitive elements of the document by at least one of the performed document reconstruction operations, wherein instructions from a profile at a lower level in the hierarchical set of profiles override instructions from a profile at a higher level; and

    defining a module for matching results of the previously-performed document reconstruction operations for a particular portion of the document to one of a plurality of profiles at a particular level in the hierarchical set of profiles in order to modify the set of document reconstruction operations to perform for the particular portion of the document, wherein the particular level of the hierarchy corresponds to the particular portion of the document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×