×

DATA CLASSIFICATION USING MACHINE LEARNING TECHNIQUES

  • US 20110145178A1
  • Filed: 02/23/2011
  • Published: 06/16/2011
  • Est. Priority Date: 07/12/2006
  • Status: Active Grant
First Claim
Patent Images

1. An article of manufacture comprising:

  • a program storage medium readable by a computer, where the medium tangibly embodies one or more programs of instructions executable by a computer to perform a method of data classification, the one or more programs of instructions comprising;

    instructions for receiving at least one labeled seed document;

    instructions for receiving unlabeled documents;

    instructions for receiving at least one predetermined cost factor;

    instructions for training a transductive classifier using the at least one predetermined cost factor, the at least one seed document, and the unlabeled documents;

    instructions for classifying the unlabeled documents having a confidence level above a predefined threshold into a plurality of categories using the classifier;

    instructions for reclassifying at least some of the categorized documents previously categorized by a different classifier into the categories using the classifier; and

    instructions for outputting identifiers of the categorized documents to at least one of a user, another system, and another process.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×