×

Concept decomposition using clustering

  • US 6,560,597 B1
  • Filed: 03/21/2000
  • Issued: 05/06/2003
  • Est. Priority Date: 03/21/2000
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of operating a computer system to represent text documents stored in a database collection, comprising:

  • representing the text documents in a vector representation format in which there are n documents and d words;

    normalizing the document vectors;

    determining an initial partitioning of the normalized document vectors comprising a set of k disjoint clusters and determining k cluster vectors, wherein a cluster vector comprises a mean vector of all the normalized document vectors in a partition;

    computing a set of K concept vectors based on the initial set of cluster vectors, wherein the concept vectors define a subspace of the document vector space and wherein the subspace spans a part of the document vector space; and

    projecting each document vector onto the subspace defined by the concept vectors, thereby defining a set of document concept decomposition vectors that represent the document vector space, with a reduced dimensionality.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×