SYSTEMS AND METHODS FOR LINKING AND ANALYZING DATA FROM DISPARATE DATA SETS
First Claim
1. A method, comprising:
- receiving, by a processor, a plurality of disparate anonymized datasets originating from a plurality of different data sources, each anonymized dataset comprising de-identified data of individuals;
formatting, by the processor, the de-identified data of each of the plurality of the disparate anonymized datasets to provide a plurality of formatted anonymized datasets, each formatted anonymized dataset containing data entries for the de-identified individuals comprising a user unique identifier (UID), date data, time data, location data, and activity data;
linking, by the processor, the data entries of the de-identified individuals of the plurality of formatted datasets by matching at least the date data, time data, and location data;
analyzing the activity data of the linked data entries; and
generating, by the processor, at least one report based on the analysis.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for linking or matching data of disparate datasets and then performing business related data analysis. Consumer-related data of two or more disparate datasets are linked in a privacy-friendly manner, and then analyzed to provide business information and/or consumer information to clients. The linking and analysis is performed in a manner to protect personally identifiable information (PII) of the consumers. In an embodiment, a processor receives a plurality of disparate anonymized datasets originating from a plurality of different data sources, formats the de-identified data to provide a plurality of formatted anonymized datasets, and links the data entries of the de-identified individuals by matching at least date data, time data, and location data. The processor then analyzes the activity data of the linked data entries, and generates a report based on the analysis.
-
Citations
21 Claims
-
1. A method, comprising:
-
receiving, by a processor, a plurality of disparate anonymized datasets originating from a plurality of different data sources, each anonymized dataset comprising de-identified data of individuals; formatting, by the processor, the de-identified data of each of the plurality of the disparate anonymized datasets to provide a plurality of formatted anonymized datasets, each formatted anonymized dataset containing data entries for the de-identified individuals comprising a user unique identifier (UID), date data, time data, location data, and activity data; linking, by the processor, the data entries of the de-identified individuals of the plurality of formatted datasets by matching at least the date data, time data, and location data; analyzing the activity data of the linked data entries; and generating, by the processor, at least one report based on the analysis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An apparatus, comprising:
-
a processor; a communication device operably connected to the processor; and a storage device operably connected to the processor and storing instructions configured to cause the processor to; receive a plurality of disparate anonymized datasets originating from a plurality of different data sources, each anonymized dataset comprising de-identified data of individuals; format the de-identified data of each of the plurality of the disparate anonymized datasets to provide a plurality of formatted anonymized datasets, each formatted anonymized dataset containing data entries for the de-identified individuals comprising a user unique identifier (UID), date data, time data, location data, and activity data; link the data entries of the de-identified individuals of the plurality of formatted datasets by matching at least the date data, time data, and location data; analyze the activity data of the linked data entries; and generate at least one report based on the analysis. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A system, comprising:
-
a probabilistic engine; an anonymized data formatting engine operably connected to the probabilistic engine; and a reporting engine operably connected to the probabilistic engine; wherein the probabilistic engine comprises a processor and a storage device operably connected to the processor and configured to cause the processor to; receive, from the anonymized data formatting engine, a plurality of disparate anonymized datasets originating from a plurality of different data sources, each anonymized dataset comprising de-identified data of individuals; format the de-identified data of each of the plurality of the disparate anonymized datasets to provide a plurality of formatted anonymized datasets, each formatted anonymized dataset containing data entries for the de-identified individuals comprising a user unique identifier (UID), date data, time data, location data, and activity data; link the data entries of the de-identified individuals of the plurality of formatted datasets by matching at least the date data, time data, and location data; analyze the activity data of the linked data entries; and transmit the analysis to the reporting engine to generate at least one report. - View Dependent Claims (20, 21)
-
Specification