×

Modular, folder based approach for semi-automated document classification

  • US 20100257127A1
  • Filed: 08/26/2008
  • Published: 10/07/2010
  • Est. Priority Date: 08/27/2007
  • Status: Abandoned Application
First Claim
Patent Images

1. A document classification system for classifying text documents into a particular category in a complex ontology comprising a set of entity means which:

  • (a) use a set of folders, and folder monitoring processes operating on documents to classify them within a subset of the ontology or domain of interest;

    (b) use an automated text classification module to make a preliminary classification of documents into a category of interest associated with the entity whereby a classification module is able to use an example set of appropriately classified documents to train itself to classify new documents that match the categories in the entity'"'"'s domain of interest with a measurable degree of accuracy;

    (c) use an external final decision step to determine whether the initial automated classification is appropriate; and

    (d) use an iterative process consisting of an automated re-classification step, in conjunction with an external decision step, to either locate the appropriate classification within the domain of interest for the entity, or to reject the document from the entity'"'"'s domain of interest to be handled by some other process.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×