×

COMPOSITE LOCALITY SENSITIVE HASH BASED PROCESSING OF DOCUMENTS

  • US 20110087669A1
  • Filed: 05/21/2010
  • Published: 04/14/2011
  • Est. Priority Date: 10/09/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method of analyzing documents belonging to a corpus, the method comprising:

  • computing a composite hash value for a current document from the corpus;

    determining whether a previous document having the same composite hash value as the current document has been analyzed;

    in the event that a previous document having the same composite hash value as the current document has not been analyzed, analyzing the current document, wherein analyzing the current document includes determining one or more items of analytic metadata to be associated with the current document;

    in the event that a previous document having the same composite hash value as the current document has been analyzed, associating the current document with one or more items of analytic metadata determined from analyzing the previous document; and

    storing a representation of the association between the one or more items of analytic metadata and the current document.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×