×

Method and apparatus for a character-based comparison of documents

  • US 20050132197A1
  • Filed: 05/13/2004
  • Published: 06/16/2005
  • Est. Priority Date: 05/15/2003
  • Status: Abandoned Application
First Claim
Patent Images

1. A method comprising:

  • dividing a first document into a plurality of tokens, each token including a predefined number of sequential characters from the first document;

    calculating a plurality of hash values for the plurality of tokens; and

    creating, for the first document, a signature including a subset of hash values from the plurality of hash values and additional information pertaining to the plurality of tokens of the first document, the signature of the first document being subsequently compared with a signature of a second document to determine resemblance between the first document and the second document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×