×

System and method for automatically classifying text

  • US 7,028,250 B2
  • Filed: 05/25/2001
  • Issued: 04/11/2006
  • Est. Priority Date: 05/25/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. In a system comprising perspectives and categories, each perspective including at least one category representative of that perspective, a computerized method for classifying at least one item across multiple perspectives, said computerized method comprising:

  • associating category features with each category, wherein each of said category features represents one of a plurality of tokens;

    producing a category vector for each category, wherein each category vector includes a weight corresponding to each category feature, said weight indicative of a degree of association between said category feature and said category;

    associating item features with each item, wherein each of said item features represents one of a plurality of tokens found in said item;

    producing a feature vector for each item, wherein each feature vector includes said item features with a count corresponding to each item feature, said count indicative of the number of times said item feature appears in said item;

    multiplying said category vector by said item vector to produce a plurality of category scores for each item; and

    for each perspective, across multiple perspectives, classifying an item into a category provided said category score exceeds a predetermined threshold.

View all claims
  • 25 Assignments
Timeline View
Assignment View
    ×
    ×