Apparatus and method for data matching and anonymization
First Claim
1. A method for data matching and anonymization, comprising:
- executing instructions in a memory and processor of an apparatus in order to perform the following steps;
storing by the processor, in at least one of volatile or non-volatile storage, a plurality of data sets, each respective data set of the plurality of data sets including a respective identifier field, the plurality of data sets including a first data set and a second group of data sets, the first data set mapping unique identifiers in the respective identifier field to a first type of information comprising sensitive household information, the second group of data sets including a second data set mapping unique identifiers in the respective identifier field to a second type of information comprising transaction information and a third data set mapping unique identifiers in the respective identifier field to a third type of information comprising advertising information, wherein the first type of information is associable with the second type of information by matching unique identifiers in the first data set and second data set and associable with the third type of information by matching unique identifiers in the first data set and the third data set;
generating, by the processor, a key map including a key identifier field and an anonymous identifier field, the key identifier field for the key map including unique identifiers of the first data set of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a unique identifier in the key identifier field of the key map, wherein each anonymous identifier is different from the unique identifier to which the anonymous identifier corresponds;
encrypting, by the processor, the key map with an encryption scheme;
storing, by the processor, the key map;
rendering, by the processor, each data set of the second group of data sets as unassociable with the first data set by replacing, based on the key map, each unique identifier in the respective identifier field of the second group of data sets with the anonymous identifier in the key map to which the unique identifier corresponds, wherein the second type of information is associable with the third type of information by matching anonymous identifiers in the respective identifier fields of the second data set and the third data set; and
executing, by the processor, an analytical query that correlates the second type of information in at least one entry of the second data set with the third type of information of at least one entry in the third data set by matching anonymous identifiers in respective identifier fields of the second data set and third data set, wherein the first type of information in the first dataset is unassociable, by the analytical query, with the second type of information and the third type of information based on the matching anonymous identifiers, wherein the first type of information, the second type of information, and the third type of information are distinct and different types of information.
2 Assignments
0 Petitions
Accused Products
Abstract
A method includes receiving a plurality of data sets. Each data set includes a customer identifier field specifying a unique customer identifier associated with each entry in each data set. The plurality of data sets includes a first group of data sets and a second group of data sets. The method further includes storing the plurality of data sets, and generating a key map including the customer identifier field including unique customer identifiers of the first group of data sets of the plurality of data sets, and an anonymous identifier field including unique anonymous identifiers. Each anonymous identifier corresponds to a customer identifier of the key map. The method further includes replacing each unique customer identifier in the second group of data sets with the corresponding anonymous identifier.
64 Citations
18 Claims
-
1. A method for data matching and anonymization, comprising:
-
executing instructions in a memory and processor of an apparatus in order to perform the following steps; storing by the processor, in at least one of volatile or non-volatile storage, a plurality of data sets, each respective data set of the plurality of data sets including a respective identifier field, the plurality of data sets including a first data set and a second group of data sets, the first data set mapping unique identifiers in the respective identifier field to a first type of information comprising sensitive household information, the second group of data sets including a second data set mapping unique identifiers in the respective identifier field to a second type of information comprising transaction information and a third data set mapping unique identifiers in the respective identifier field to a third type of information comprising advertising information, wherein the first type of information is associable with the second type of information by matching unique identifiers in the first data set and second data set and associable with the third type of information by matching unique identifiers in the first data set and the third data set; generating, by the processor, a key map including a key identifier field and an anonymous identifier field, the key identifier field for the key map including unique identifiers of the first data set of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a unique identifier in the key identifier field of the key map, wherein each anonymous identifier is different from the unique identifier to which the anonymous identifier corresponds; encrypting, by the processor, the key map with an encryption scheme; storing, by the processor, the key map; rendering, by the processor, each data set of the second group of data sets as unassociable with the first data set by replacing, based on the key map, each unique identifier in the respective identifier field of the second group of data sets with the anonymous identifier in the key map to which the unique identifier corresponds, wherein the second type of information is associable with the third type of information by matching anonymous identifiers in the respective identifier fields of the second data set and the third data set; and executing, by the processor, an analytical query that correlates the second type of information in at least one entry of the second data set with the third type of information of at least one entry in the third data set by matching anonymous identifiers in respective identifier fields of the second data set and third data set, wherein the first type of information in the first dataset is unassociable, by the analytical query, with the second type of information and the third type of information based on the matching anonymous identifiers, wherein the first type of information, the second type of information, and the third type of information are distinct and different types of information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium, storing instructions, which when executed by one or more hardware processors, cause operations comprising:
-
storing, in at least one of volatile or non-volatile storage, a plurality of data sets, each respective data set of the plurality of data sets including a respective identifier field, the plurality of data sets including a first data set and a second group of data sets, the first data set mapping unique identifiers in the respective identifier field to a first type of information comprising sensitive household information, the second group of data sets including a second data set mapping unique identifiers in the respective identifier field to a second type of information comprising transaction information and a third data set mapping unique identifiers in the respective identifier field to a third type of information comprising advertising information, wherein the first type of information is associable with the second type of information by matching unique identifiers in the first data set and second data set and associable with the third type of information by matching unique identifiers in the first data set and the third data set; generating a key map including a key identifier field and an anonymous identifier field, the key identifier field for the key map including unique identifiers of the first data set of the plurality of data sets, the anonymous identifier field including unique anonymous identifiers, each anonymous identifier corresponding to a unique identifier in the key identifier field of the key map, wherein each anonymous identifier is different from the unique identifier to which the anonymous identifier corresponds; encrypting the key map with an encryption scheme; storing the key map; rendering each data set of the second group of data sets as unassociable with the first data set by replacing, based on the key map, each unique identifier in the respective identifier field of the second group of data sets with the anonymous identifier in the key map to which the unique identifier corresponds, wherein the second type of information is associable with the third type of information by matching anonymous identifiers in the respective identifier fields of the second data set and the third data set; and executing an analytical query that correlates the second type of information in at least one entry of the second data set with the third type of information of at least one entry in the third data set by matching anonymous identifiers in respective identifier fields of the second data set and third data set, wherein the first type of information in the first dataset is unassociable, by the analytical query, with the second type of information and the third type of information based on the matching anonymous identifiers, wherein the first type of information, the second type of information, and the third type of information are distinct and different types of information. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification