×

Suffix tree similarity measure for document clustering

  • US 8,676,815 B2
  • Filed: 05/06/2009
  • Issued: 03/18/2014
  • Est. Priority Date: 05/07/2008
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • at least one memory having stored therein computer executable instructions; and

    a processor, coupled to the at least one memory, configured to execute or facilitate execution of the computer executable instructions to at least;

    create a suffix tree document model that is a representation of a plurality of documents;

    convert the suffix tree document model into a vector document model that is a representation of a document of the plurality of documents to form the suffix tree document model converted into the vector document model, wherein the vector document model is a vector with M elements and M is a total number of nodes in the suffix tree document model;

    weight elements of the suffix tree document model converted into the vector document model; and

    determine a similarity between two or more weighted vector document models, each representing a respective document of the plurality of documents.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×