×

Text categorization with knowledge transfer from heterogeneous datasets

  • US 8,103,671 B2
  • Filed: 10/10/2008
  • Issued: 01/24/2012
  • Est. Priority Date: 10/11/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for classifying input text data by obtaining information from multiple datasets, the method comprising the steps of:

  • receiving, at a computer, the input text data;

    accessing, at the computer, a plurality of heterogeneous datasets, the plurality of heterogeneous datasets each including text data;

    generating, at the computer, a set of features from the plurality of heterogeneous datasets, the set of features including one or more features from each of the plurality of heterogeneous datasets;

    selecting, at the computer, one or more classification features from the set of features; and

    generating, at the computer, an augmented input text data by combining the input text data and the one or more classification features; and

    applying, at the computer, a classifier to the augmented input text data to associate the input text data with a category.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×