×

Methods and apparatus for interactive document clustering

  • US 20090287668A1
  • Filed: 05/16/2008
  • Published: 11/19/2009
  • Est. Priority Date: 05/16/2008
  • Status: Abandoned Application
First Claim
Patent Images

1. A computerized method for forming clusters of documents from among a set of documents, the method comprising:

  • (a) identifying a plurality of seed candidate documents;

    (b) generating candidate probes based upon the seed candidate documents, the candidate probes each comprising one or more features from the seed candidate documents;

    (c) displaying information regarding the candidate probes to a user;

    (d) receiving user input regarding the candidate probes and defining a set of probes from which to form clusters of documents based upon the user input regarding the candidate probes;

    (e) selecting a probe and forming a cluster of documents from among available documents of the set of documents using the probe, wherein forming the cluster of documents comprises finding documents that satisfy a similarity condition relative to the probe and associating some or all of the documents that satisfy the similarity condition with a particular cluster of documents; and

    (f) repeating step (e) using another probe as the probe and using another similarity condition as the similarity condition until a halting condition is satisfied to form at least one other cluster of documents,wherein those documents of the set of documents previously associated with a cluster of documents are not included among the available documents.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×