×

Method and system for using keywords to merge document clusters

  • US 8,843,494 B1
  • Filed: 04/23/2013
  • Issued: 09/23/2014
  • Est. Priority Date: 03/28/2012
  • Status: Active Grant
First Claim
Patent Images

1. A system for using keywords to merge document clusters, the system comprising:

  • one or more processors; and

    a non-transitory computer readable medium storing a plurality of instructions, which when executed, cause the one or more processors to;

    distribute a plurality of documents into a plurality of document clusters, wherein the plurality of document clusters comprise a first document cluster comprising a first plurality of documents and a second document cluster comprising a second plurality of documents;

    create a template associated with the first document cluster, wherein the template comprises a plurality of keywords associated with at least most of the first plurality of documents;

    calculate a distance between keyword location information associated with the template and word location information associated with a document in the second document cluster, wherein the keyword location information comprises information indicating a location of a keyword in the template relative to other keywords in the template, and wherein the word location information comprises information indicating a location of a word in the document relative to other words in the document;

    determine whether the distance is less than a threshold value; and

    merge the second document cluster with the first document cluster in response to a determination that the distance is less than the threshold value.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×