×

Method and apparatus for automatically determining salient features for object classification

  • US 6,938,025 B1
  • Filed: 09/25/2001
  • Issued: 08/30/2005
  • Est. Priority Date: 05/07/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for classifying one or more electronic documents, said method comprising:

  • extracting one or more unique features from a first content group of data objects representing a first group of electronic documents to form a first feature list;

    extracting one or more unique features from a second anti-content group of data objects representing a second group of electronic documents to form a second feature list;

    identifying those unique features of said first feature list that are not present in said second feature list;

    identifying those unique features of said first feature list that are also present in said second feature list;

    creating a ranked list of features by applying statistical differentiation between unique features of said first feature list and unique features of said second feature list, wherein those unique features of said first feature list that are not present in said second feature list are ranked higher within said ranked list as compared to those unique features of said first feature list that are also present in said second feature list;

    identifying a set of salient features from said ranked list of features, wherein the set of salient features distinguishes the first group of electronic documents from the second group of electronic documents; and

    classifying the first group of electronic documents and the second group of electronic documents based on the set of salient features.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×