Please download the dossier by clicking on the dossier button x
×

ENHANCED IDENTIFICATION OF DOCUMENT TYPES

  • US 20120041955A1
  • Filed: 08/10/2010
  • Published: 02/16/2012
  • Est. Priority Date: 08/10/2010
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for document management, the method comprising:

  • automatically extracting respective features from each of a set of documents;

    processing the features in a computer so as to generate respective vectors for the documents, each vector comprising elements having respective values that represent properties of a respective document;

    assessing a similarity between the documents by computing a measure of distance between the respective vectors; and

    automatically clustering the documents responsively to the similarity so as to identify a cluster of the documents belonging to a common document type.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×