×

DATA CLASSIFICATION METHODS USING MACHINE LEARNING TECHNIQUES

  • US 20080086433A1
  • Filed: 05/23/2007
  • Published: 04/10/2008
  • Est. Priority Date: 07/12/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method for adapting to a shift in document content, comprising:

  • receiving at least one labeled seed document;

    receiving unlabeled documents;

    receiving at least one predetermined cost factor;

    training a transductive classifier using the at least one predetermined cost factor, the at least one seed document, and the unlabeled documents;

    classifying the unlabeled documents having a confidence level above a predefined threshold into a plurality of categories using the classifier;

    reclassifying at least some of the categorized documents into the categories using the classifier; and

    outputting identifiers of the categorized documents to at least one of a user, another system, and another process.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×