×

Enterprise data duplication identification

  • US 8,429,137 B2
  • Filed: 09/02/2010
  • Issued: 04/23/2013
  • Est. Priority Date: 09/02/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying duplicate data, the method comprising the steps, performed by a computer, of:

  • identifying one or more reference fields that include one or more data values;

    retrieving the reference fields;

    generating one or more reference fingerprint patterns;

    transforming the reference fields into the one or more reference fingerprint patterns;

    identifying one or more target fields that include one or more data values;

    retrieving the target fields;

    generating one or more target fingerprint patterns;

    transforming the target fields into the one or more target fingerprint patterns;

    comparing the one or more target fingerprint patterns with the one or more reference fingerprint patterns; and

    determining an overlap between the one or more target fingerprint patterns and the one or more reference fingerprint patterns to identify duplicate data, wherein the one or more reference fingerprint patterns and one or more target fingerprint patterns include one or more letters and one or more numbers.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×