×

System And Method For Clustering Unstructured Documents

  • US 20080104063A1
  • Filed: 12/24/2007
  • Published: 05/01/2008
  • Est. Priority Date: 08/31/2001
  • Status: Active Grant
First Claim
Patent Images

1. A system for clustering unstructured documents, comprising:

  • a selection module to select documents having terms with frequencies of occurrence that satisfy upper and lower edge conditions;

    a concept module to generate concepts for the selected documents; and

    a cluster module to group the selected documents into clusters of the documents, comprising;

    an evaluation module to evaluate a weight for each of the clusters;

    a determination module to determine a similarity value from the frequencies of occurrence for at least one of the terms from the concepts and the cluster weights for each selected document; and

    an assignment module to assign each selected document into one such cluster based on the similarity value of the selected document.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×