×

Index extraction from documents

  • US 20060036614A1
  • Filed: 08/12/2004
  • Published: 02/16/2006
  • Est. Priority Date: 08/12/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method for index extraction, comprising the steps of:

  • storing a plurality of ground truth documents in a database, the documents being organized according to a plurality of classifications, each classification having a group of predefined indices;

    classifying a document by drawing an association between the document to be indexed and one of the classifications;

    attempting to extract from the document at least a subset of the group of predefined indices associated with the one of the classifications; and

    attempting to find and correct at least one text recognition error in the document based upon a salient dictionary associated with the one of the classifications upon a failure to extract the subset of the group of predefined indices.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×