Apparatus and method for data matching and anonymization
First Claim
1. A method, comprising:
- receiving a plurality of data sets, each data set including a customer identifier field, the customer identifier field specifying a unique customer identifier associated with each entry in each data set, the plurality of data sets including a first group of data sets and a second group of data sets, each data set associable with each other data set based on the customer identifier;
storing the plurality of data sets;
generating a key map including the customer identifier field and an anonymous identifier field, the customer identifier field for the key map including unique customer identifiers of the first group of data sets of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a customer identifier of the key map, each anonymous identifier different from its corresponding customer identifier;
storing the key map;
rendering each data set of the second group of data sets as unassociable with each data set of the first group of data sets by replacing, based on the key map, each unique customer identifier in the second group of data sets with the corresponding anonymous identifier, further including rendering each data set of the second group of data sets associable with each other data set of the second group of data sets based on the anonymous identifier; and
deleting the key map.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes receiving a plurality of data sets. Each data set includes a customer identifier field specifying a unique customer identifier associated with each entry in each data set. The plurality of data sets includes a first group of data sets and a second group of data sets. The method further includes storing the plurality of data sets, and generating a key map including the customer identifier field including unique customer identifiers of the first group of data sets of the plurality of data sets, and an anonymous identifier field including unique anonymous identifiers. Each anonymous identifier corresponds to a customer identifier of the key map. The method further includes replacing each unique customer identifier in the second group of data sets with the corresponding anonymous identifier.
-
Citations
17 Claims
-
1. A method, comprising:
-
receiving a plurality of data sets, each data set including a customer identifier field, the customer identifier field specifying a unique customer identifier associated with each entry in each data set, the plurality of data sets including a first group of data sets and a second group of data sets, each data set associable with each other data set based on the customer identifier; storing the plurality of data sets; generating a key map including the customer identifier field and an anonymous identifier field, the customer identifier field for the key map including unique customer identifiers of the first group of data sets of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a customer identifier of the key map, each anonymous identifier different from its corresponding customer identifier; storing the key map; rendering each data set of the second group of data sets as unassociable with each data set of the first group of data sets by replacing, based on the key map, each unique customer identifier in the second group of data sets with the corresponding anonymous identifier, further including rendering each data set of the second group of data sets associable with each other data set of the second group of data sets based on the anonymous identifier; and deleting the key map. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method, comprising:
-
receiving a reference data set including a customer identifier field, the customer identifier field of the reference data set specifying a unique customer identifier associated with each entry in the reference data set; receiving a first data set including the customer identifier field, the customer identifier field for the first data set specifying a unique customer identifier associated with each entry in the first data set, the first data set associable with the reference data set based on the customer identifier field; storing the reference data set and the first data set; rendering the first data set unassociable with the reference data set by replacing each unique customer identifier of the second data set with a corresponding anonymous identifier based on a key map, the key map including the customer identifier field and an anonymous identifier field, the customer identifier field for the key map including the unique customer identifiers of the second data set, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a customer identifier of the key map, each anonymous identifier different from its corresponding customer identifier; receiving one or more additional data sets, each additional data set including the customer identifier field, the one or more additional data sets associable with the reference data set based on the customer identifier field and unassociable with the first data set; rendering the one or more additional data sets as unassociable with the reference data set by replacing each unique customer identifier in each additional data set with its corresponding anonymous identifier, based on the key map, the rendering including rendering the one or more additional data sets as associable with the first data set based on the anonymous identifier field; and deleting the key map. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. An apparatus, comprising:
-
a processor operatively coupled to a memory, the processor configured to execute a data module and a key module; the data module configured to; receive a plurality of data sets, each data set including a customer identifier field, the customer identifier field specifying a unique customer identifier associated with each entry in each data set, the plurality of data sets including a first group of data sets and a second group of data sets, each data set associable with each other data set based on the customer identifier; and store the plurality of data sets; and the key module configured to generate a key map including the customer identifier field and an anonymous identifier field, the customer identifier field for the key map including unique customer identifiers of the first group of data sets of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a customer identifier of the key map, each anonymous identifier different from its corresponding customer identifier, the data module further configured to store the key map, the key module further configured to render each data set of the second group of data sets as unassociable with each data set of the first group of data sets by replacing, based on the key map, each unique customer identifier in the second group of data sets with the corresponding anonymous identifier, including rendering each data set of the second group of data sets associable with each other data set of the second group of data sets based on the anonymous identifier, and the data module further configured to delete the key map. - View Dependent Claims (14, 15, 16, 17)
-
Specification