×

Method and apparatus for duplicate detection

  • US 7,899,825 B2
  • Filed: 04/04/2008
  • Issued: 03/01/2011
  • Est. Priority Date: 06/27/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer readable storage medium, comprising executable instructions to:

  • compute a first measure of similarity between a first document and a reference document, wherein the first measure of similarity is a scalar or a multi-dimensional indication of distance between the first document and the reference document;

    compute a second measure of similarity between a second document and the reference document, wherein the second measure of similarity is a scalar or a multi-dimensional indication of distance between the second document and the reference document;

    compare the first measure of similarity and the second measure of similarity through triangulation to identify a non-exact similarity match between the first document and the second document; and

    perform a direct comparison of the first document and the second document in response to the identified non-exact similarity match to compute a third measure of similarity, wherein the third measure of similarity is a scalar or a multi-dimensional indication of distance between the first document and the second document, and the third measure of similarity has a finer granularity than the first measure of similarity and the second measure of similarity.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×