Method and apparatus for variable privacy preservation in data mining
First Claim
1. A method for preserving privacy of data records for use in a data mining application, comprising the steps of:
- assigning different privacy levels to the data records;
constructing condensed groups of data records from the data records based on the privacy levels, wherein a data record having a first privacy level is condensed with a data record having a second privacy level different than the first privacy level;
maintaining summary statistics for each condensed group of data records, wherein the summary statistics for a given condensed group of data records includes information regarding a sum of privacy levels of data records in the given condensed group of data records; and
generating pseudo-data from the summary statistics, wherein the pseudo-data is available for use in the data mining application,wherein the assigning, constructing, maintaining and generating steps are performed by a data processing system.
0 Assignments
0 Petitions
Accused Products
Abstract
Improved privacy preservation techniques are disclosed for use in accordance with data mining. By way of example, a technique for preserving privacy of data records for use in a data mining application comprises the following steps/operations. Different privacy levels are assigned to the data records. Condensed groups are constructed from the data records based on the privacy levels, wherein summary statistics are maintained for each condensed group. Pseudo-data is generated from the summary statistics, wherein the pseudo-data is available for use in the data mining application. Principles of the invention are capable of handling both static and dynamic data sets.
3 Citations
20 Claims
-
1. A method for preserving privacy of data records for use in a data mining application, comprising the steps of:
-
assigning different privacy levels to the data records; constructing condensed groups of data records from the data records based on the privacy levels, wherein a data record having a first privacy level is condensed with a data record having a second privacy level different than the first privacy level; maintaining summary statistics for each condensed group of data records, wherein the summary statistics for a given condensed group of data records includes information regarding a sum of privacy levels of data records in the given condensed group of data records; and generating pseudo-data from the summary statistics, wherein the pseudo-data is available for use in the data mining application, wherein the assigning, constructing, maintaining and generating steps are performed by a data processing system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. Apparatus for preserving privacy of data records for use in a data mining application, comprising:
-
a memory; and a processor coupled to the memory and operative to; assign different privacy levels to the data records; construct condensed groups of data records from the data records based on the privacy levels, wherein a data record having a first privacy level is condensed with a data record having a second privacy level different than the first privacy level; maintain summary statistics for each condensed group of data records, wherein the summary statistics for a given condensed group of data records includes information regarding a sum of privacy levels of data records in the given condensed group of data records; and generate pseudo-data from the summary statistics, wherein the pseudo-data is available for use in the data mining application. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An article of manufacture for use in preserving privacy of data records for use in a data mining application, the article comprising a machine readable storage device containing one or more programs which when executed implement the steps of:
-
assigning different privacy levels to the data records; constructing condensed groups of data records from the data records based on the privacy levels, wherein a data record having a first privacy level is condensed with a data record having a second privacy level different than the first privacy level; maintaining summary statistics for each condensed group of data records, wherein the summary statistics for a given condensed group of data records includes information regarding a sum of privacy levels of data records in the given condensed group of data records; and generating pseudo-data from the summary statistics, wherein the pseudo-data is available for use in the data mining application. - View Dependent Claims (20)
-
Specification