×

Suffix Tree Similarity Measure for Document Clustering

  • US 20090307213A1
  • Filed: 05/06/2009
  • Published: 12/10/2009
  • Est. Priority Date: 05/07/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer system comprising at least one memory having stored therein computer executable components for facilitating document similarity measure and a processor that executes the computer executable components, the computer executable components comprising:

  • a mapping component to map a suffix tree document model to a vector document model, wherein the vector document model is a vector with M elements, and M is the total number of nodes in the suffix tree document model;

    a weighting component to weight elements of the mapped vector document model; and

    a similarity component to determine the similarity between two or more weighted vector document models.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×