System and method for privacy management of infinite data streams
First Claim
1. An apparatus for preserving streaming data anonymity, the apparatus comprising:
- a physical memory configured to store data streams of user data; and
at least one hardware processor operably connected to the physical memory, the at least one hardware processor configured to;
continuously collect the data streams of the user data comprising participation data or sensor data from a plurality of user devices;
convert the user data into a data summary that represents the participation data or the sensor data for a certain time period;
transform the data summary to a distorted summary based on a difference level for preserving differential privacy of the user data, wherein to transform the data summary, the hardware processor is configured to;
determine whether the difference level between the data summary and a previous data summary exceeds a difference threshold;
transform the data summary to the distorted summary by adding a first noise when the difference level exceeds the difference threshold;
when the difference level does not exceed the difference threshold;
determine whether a retroactive count of the data summary is greater than a retroactive count threshold,transform the data summary to the distorted summary by adding a second noise when the retroactive count is greater than the retroactive count threshold, andtransform the data summary to the distorted summary by adding a third noise when the retroactive count is not greater than the retroactive count threshold; and
transmit the distorted summary to a data processing system for statistic based services.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus, method, and computer readable medium for management of infinite data streams. The apparatus includes a memory that stores streaming data with a data set and a processor operably connected to the memory. The processor transforms the data set to a second data set. To transform the data set, the processor determines whether a difference level exceeds a threshold, and transforms the data set by adding a noise when the difference level exceeds the threshold. When the difference level does not exceed the threshold, the processor determines whether a retroactive count is greater than a threshold, transforms the data set by adding a second noise when the retroactive count is greater than the threshold, and transforms the data set by adding a third noise when the retroactive count is not greater than the threshold. The processor transmits the second data set to a data processing system for further processing.
23 Citations
20 Claims
-
1. An apparatus for preserving streaming data anonymity, the apparatus comprising:
-
a physical memory configured to store data streams of user data; and at least one hardware processor operably connected to the physical memory, the at least one hardware processor configured to; continuously collect the data streams of the user data comprising participation data or sensor data from a plurality of user devices; convert the user data into a data summary that represents the participation data or the sensor data for a certain time period; transform the data summary to a distorted summary based on a difference level for preserving differential privacy of the user data, wherein to transform the data summary, the hardware processor is configured to; determine whether the difference level between the data summary and a previous data summary exceeds a difference threshold; transform the data summary to the distorted summary by adding a first noise when the difference level exceeds the difference threshold; when the difference level does not exceed the difference threshold; determine whether a retroactive count of the data summary is greater than a retroactive count threshold, transform the data summary to the distorted summary by adding a second noise when the retroactive count is greater than the retroactive count threshold, and transform the data summary to the distorted summary by adding a third noise when the retroactive count is not greater than the retroactive count threshold; and transmit the distorted summary to a data processing system for statistic based services. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
continuously collecting, by a first device, data streams of user data comprising participation data or sensor data from a plurality of user devices; convert the user data into a data summary that represents the participation data or the sensor data for a certain time period; transforming the data summary to a distorted summary based on a difference level for preserving differential privacy of the user data, wherein to transform the data summary comprises; determining whether the difference level between the data summary and a previous data summary exceeds a difference threshold; transforming the data summary to the distorted summary by adding a first noise when the difference level exceeds the difference threshold; when the difference level does not exceed the difference threshold; determining whether a retroactive count of the data summary is greater than a retroactive count threshold, transforming the data summary to the distorted summary by adding a second noise when the retroactive count is greater than the retroactive count threshold, and transforming the data summary to the distorted summary by adding a third noise when the retroactive count is not greater than the retroactive count threshold; and transmitting the distorted summary to a data processing system for statistic based services. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer readable medium embodying a computer program, the computer program comprising computer readable program code that when executed causes at least one processing device to:
-
continuously collect data streams of user data comprising participation data or sensor data from a plurality of user devices; convert the user data into a data summary that represents the participation data or the sensor data for a certain time period; transform the data summary to a distorted summary based on a difference level for preserving differential privacy of the user data, wherein to transform the data summary, the computer readable program code that when executed causes at least one processing device to; determine whether the difference level between the data summary and a previous data summary exceeds a difference threshold; transform the data summary to the distorted summary by adding a first noise when the difference level exceeds the difference threshold; when the difference level does not exceed the difference threshold; determine whether a retroactive count of the data summary is greater than a retroactive count threshold, transform the data summary to the distorted summary by adding a second noise when the retroactive count is greater than the retroactive count threshold, and transform the data summary to the distorted summary by adding a third noise when the retroactive count is not greater than the retroactive count threshold; and transmit the distorted summary to a data processing system for statistic based services. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification