×

Methods and apparatuses for classifying electronic documents

  • US 7,890,441 B2
  • Filed: 04/14/2009
  • Issued: 02/15/2011
  • Est. Priority Date: 11/03/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing a training set of electronic documents for document processing, the method implemented by computer instructions executing on a computer processor, said method comprising:

  • receiving said training set of electronic documents that are each assigned to two or more categories;

    determining a first set of frequencies with which a set of document features appear in the training set of electronic documents;

    determining a second set of frequencies with which the set of document features appear in each of the two or more categories of training the set of electronic documents;

    selecting a subset of said set of document features for defining a multi-dimensional vector space for processing documents, said subset of document features selected from said set of document features based upon said first set of frequencies and said second set of frequencies; and

    reducing each electronic document of the training set of electronic documents to a multi-dimensional vector in the multi-dimensional vector space.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×