×

Enterprise Data Duplication Identification

  • US 20120059827A1
  • Filed: 09/02/2010
  • Published: 03/08/2012
  • Est. Priority Date: 09/02/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying duplicate data, the method comprising the steps, performed by a computer, of:

  • identifying one or more reference fields that include one or more data values;

    retrieving the reference fields;

    generating one or more reference fingerprint patterns;

    transforming the reference fields into the one or more reference fingerprint patterns;

    identifying one or more target fields that include one or more data values;

    retrieving the target fields;

    transforming the target fields into the one or more target fingerprint patterns;

    comparing the one or more target fingerprint patterns with the one or more reference fingerprint patterns; and

    determining an overlap between the one or more target fingerprint patterns and the one or more reference fingerprint patterns.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×