×

TEXT CATEGORIZATION WITH KNOWLEDGE TRANSFER FROM HETEROGENEOUS DATASETS

  • US 20090171956A1
  • Filed: 10/10/2008
  • Published: 07/02/2009
  • Est. Priority Date: 10/11/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method for classifying input text data by obtaining information from multiple datasets, the method comprising the steps of:

  • receiving the input text data;

    accessing a plurality of heterogeneous datasets, the plurality of heterogeneous datasets each including text data;

    generating a set features from the plurality of heterogeneous datasets, the set of features including one or more features from each of the plurality of heterogeneous datasets;

    selecting one or more classification features from the set of features; and

    generating an augmented input text data by combining the input text data and the one or more classification features; and

    applying a classifier to the augmented input text data to associate the input text data with a category.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×