×

Method and apparatus for measuring similarity among electronic documents

  • US 6,990,628 B1
  • Filed: 06/14/1999
  • Issued: 01/24/2006
  • Est. Priority Date: 06/14/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer implemented method of categorizing a plurality of new electronic documents into a set of categories, comprising the steps of:

  • establishing a plurality of training sets, wherein each training set is associated with a category and includes training documents that have been classified as belonging to said associated category;

    determining how strongly each document of said plurality of documents corresponds to each of said plurality of categories by determining similarity between said each document and the training documents that belong to the training set of said category; and

    wherein the step of determining similarity is performed using a matrix representing document similarity that is derived by combining two or more measures of document similarity.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×