×

SYSTEM AND METHOD FOR AUTOMATIC DOCUMENT CLASSIFICATION IN EDISCOVERY, COMPLIANCE AND LEGACY INFORMATION CLEAN-UP

  • US 20140156567A1
  • Filed: 12/04/2012
  • Published: 06/05/2014
  • Est. Priority Date: 12/04/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented system for automatic document classification, the system comprising:

  • an extraction module configured to extract structural, syntactical and/or semantic information from a document and normalize the extracted information;

    a machine learning module configured to generate a model representation for automatic document classification based on feature vectors built from the normalized and extracted semantic information for supervised and/or unsupervised clustering or machine learning; and

    a classification module configured to select a non-classified document from a document collection, and via the extraction module extract normalized structural, syntactical and/or semantic information from the selected document, and generate via the machine learning module a model representation of the selected document based on feature vectors, and match the model representation of the selected document against the machine learning model representation to generate a document category, and/or classification for display to a user.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×