×

Method and system for text mining using multidimensional subspaces

  • US 6,611,825 B1
  • Filed: 06/09/1999
  • Issued: 08/26/2003
  • Est. Priority Date: 06/09/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of representing a document collection, wherein the document collection comprises a plurality of documents, with each document comprising a plurality of terms, using a:

  • subspace projection based on a distribution of the frequency of occurrences of each of the terms in each of the documents, the method comprising;

    (a) constructing a term frequency matrix, wherein each entry of the term frequency matrix is the frequency of occurrence of one of the terms in one of the documents;

    (b) determining a statistical transformation policy;

    (c) statistically transforming the entries of the term frequency matrix according to the statistical transformation policy;

    (d) determining a projection type;

    (e) determining a lower dimensional subspace; and

    (f) generating an original term subspace by projecting the projection type into the lower dimensional subspace.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×