×

Computer-implemented system and method for generating a reference set via clustering

  • US 9,336,496 B2
  • Filed: 12/16/2013
  • Issued: 05/10/2016
  • Est. Priority Date: 08/24/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for generating a reference set via clustering, comprising the steps of:

  • obtaining a collection of unclassified documents;

    grouping the unclassified documents into clusters;

    selecting n-documents from each cluster, comprising;

    building a hierarchical tree of the clusters; and

    traversing the hierarchical tree to identify the n-documents, wherein one of the n-documents from each cluster is located closest to a center of that cluster;

    combining the selected n-documents as reference set candidatesassigning a classification code to each of the reference set candidates; and

    grouping two or more of the reference set candidates as a reference set of classified documents,wherein the steps are performed by a suitably programmed computer.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×