×

Index extraction from documents

  • US 20060036649A1
  • Filed: 08/12/2004
  • Published: 02/16/2006
  • Est. Priority Date: 08/12/2004
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for indexing documents, comprising the steps of:

  • storing a plurality of ground truth documents in a database, the ground truth documents being organized in a plurality of classifications;

    attempting to automatically extract indices from a document based upon a classification associated with the document;

    reclassifying the document from a first one of the classifications to a second one of the classifications during the course of the automated extraction of the indices by drawing an association between the document and at least one of the ground truth documents;

    manually extracting the indices from the document upon a failure to automatically extract the indices; and

    storing the document in the database as one of the ground truth documents if the indices are manually extracted.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×