×

Clustering based text classification

  • US 7,366,705 B2
  • Filed: 08/16/2004
  • Issued: 04/29/2008
  • Est. Priority Date: 04/15/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for text classification, the method comprising:

  • clustering text comprising labeled data and unlabeled data in view of the labeled data to generate one or more clusters;

    generating expanded labeled data as a function of the one or more clusters, the expanded label data comprising the labeled data and at least a portion of the unlabeled data;

    training one or more discriminative classifiers based on the expanded labeled data and remaining ones of the unlabeled data; and

    generating, using the one or more discriminative classifiers, classified text for information retrieval.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×