×

Classification method and apparatus

  • US 6,976,207 B1
  • Filed: 04/27/2000
  • Issued: 12/13/2005
  • Est. Priority Date: 04/28/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for building a classification model for classifying unclassified documents based on the classification of a plurality of documents which respectively have been classified as belonging to one of a plurality of classes, said documents being digitally represented in a computer, said documents respectively comprising a plurality of terms which respectively comprise one or more symbols of a finite set of symbols, and said method comprising the following steps:

  • processing said plurality of documents according to a plurality of classification schemes;

    representing each of said plurality of documents in each of said classification schemes by a vector of n dimensions, said n dimensions forming a vector space, whereas the value of each dimension of said vector corresponds to the frequency of occurrence of a certain term in the document corresponding to said vector, so that said n dimensions span up a vector space;

    representing the classification of said already classified documents into a plurality of classes corresponding to respective ones of said plurality of classification schemes by, for each of said classification schemes, separating said vector space into a plurality of subspaces by one or more hyperplanes, such that each subspace comprises one or more documents as represented by their corresponding vectors in said vector space, so that said each subspace corresponds to a class defined by a corresponding one of said plurality of classification schemes.

View all claims
  • 13 Assignments
Timeline View
Assignment View
    ×
    ×