×

Latent semantic clustering

  • US 20060242140A1
  • Filed: 05/11/2006
  • Published: 10/26/2006
  • Est. Priority Date: 04/26/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method for automatically identifying clusters of conceptually-related documents in a collection of documents, comprising:

  • (a) generating a document-representation of each document in an abstract mathematical space;

    (b) identifying a plurality of document clusters in the collection of documents based on a conceptual similarity between respective pairs of the document-representations, wherein each document cluster is associated with an exemplary document and a plurality of other documents; and

    (c) identifying a non-intersecting document cluster from among the plurality of document clusters based on (i) a conceptual similarity between the document-representation of the exemplary document and the document-representation of each document in the non-intersecting cluster and (ii) a conceptual dissimilarity between a cluster-representation of the non-intersecting document cluster and a cluster-representation of each other document cluster.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×