Content Profiling to Dynamically Configure Content Processing
First Claim
1. A method for defining a program for reconstructing a document, the method comprising:
- defining a default set of document reconstruction operations for defining a structured document from a document that comprises a plurality of primitive elements;
defining a hierarchical set of profiles, each profile comprising (i) a set of potential document reconstruction results and (ii) instructions for modifying the document reconstruction operations when intermediate document reconstruction results match the potential document reconstruction results for the profile, wherein instructions from a profile at a lower level in the hierarchy override instructions from a profile at a higher level;
defining a module for matching intermediate document reconstruction results to a profile.
1 Assignment
0 Petitions
Accused Products
Abstract
Some embodiments provide a method that receives an unstructured document including a number of primitive elements. The method identifies a default set of document reconstruction operations for reconstructing the unstructured document to define a structured document the method performs at least one of the document reconstruction operations from the default set. Based on results of the performed document reconstruction operations, the method identifies a profile for the unstructured document. The method modifies the set of document reconstruction operations for reconstructing the unstructured document according to the identified profile.
181 Citations
24 Claims
-
1. A method for defining a program for reconstructing a document, the method comprising:
-
defining a default set of document reconstruction operations for defining a structured document from a document that comprises a plurality of primitive elements; defining a hierarchical set of profiles, each profile comprising (i) a set of potential document reconstruction results and (ii) instructions for modifying the document reconstruction operations when intermediate document reconstruction results match the potential document reconstruction results for the profile, wherein instructions from a profile at a lower level in the hierarchy override instructions from a profile at a higher level; defining a module for matching intermediate document reconstruction results to a profile. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer readable medium storing a computer program for execution by at least one processor, the computer program comprising sets of instructions for:
-
receiving a document comprising a plurality of primitive elements; identifying a default set of document reconstruction operations for reconstructing the document to define a structured document; performing one or more of the document reconstruction operations from the default set; based on results of the performed document reconstruction operations, identifying a profile for the unstructured document; and modifying the set of document reconstruction operations for reconstructing the unstructured document according to the identified profile. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
Specification