×

Method for extracting, interpreting and standardizing tabular data from unstructured documents

  • US 20060288268A1
  • Filed: 05/27/2005
  • Published: 12/21/2006
  • Est. Priority Date: 05/27/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing unstructured documents containing tabular data, the method comprising the steps of:

  • a. identifying a table in the unstructured document using a set of identification rules;

    b. tokenizing the content of the identified table using a set of parsing rules;

    c. interpreting the tokenized content of the table using a set of mapping rules; and

    d. standardizing the content of the table using a set of standardization rules.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×