×

TECHNIQUES FOR COMPARING AND CLUSTERING DOCUMENTS

  • US 20130013612A1
  • Filed: 07/07/2011
  • Published: 01/10/2013
  • Est. Priority Date: 07/07/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for analyzing documents, the method comprising:

  • importing into a database a plurality of documents and/or document portions, at least some of the documents and/or document portions being structured and at least some of the documents and/or document portions being unstructured;

    organizing the imported documents and/or document portions into one or more collections;

    receiving a selection of at least one of said one or more collections;

    building an index of words and/or groups of words based on each said document or document portion in each said selection;

    building a document-word matrix including a value indicative of a number of times each said word and/or group of words in the index of words and/or groups of words appears in each said document or document portion in each said selection; and

    generating, via at least one processor, one or more clusters of documents using the document-word matrix.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×