Method, apparatus and computer program product for preserving privacy in data mining
First Claim
Patent Images
1. A method for preserving privacy in mining of sparse high dimensional data records, comprising:
- receiving the data records having high dimensionality; and
transforming the data records into anonymized data records for use in data mining by creating a sketch-based private representation of each data record, wherein each data record contains only a small number of non-zero attribute values in relation to the high dimensionality of the data records;
wherein the sketch of a record Xl . . . Xd is defined by the quantity sj such that;
SJ=Σ
i=1dxi·
rij where the random variable rij is drawn from {−
1, +1} with a mean of 0.
1 Assignment
0 Petitions
Accused Products
Abstract
Privacy in data mining of sparse high dimensional data records is preserved by transforming the data records into anonymized data records. This transformation involves creating a sketch-based private representation of each data record, each data record containing only a small number of non-zero attribute value in relation to the high dimensionality of the data records.
5 Citations
13 Claims
-
1. A method for preserving privacy in mining of sparse high dimensional data records, comprising:
-
receiving the data records having high dimensionality; and transforming the data records into anonymized data records for use in data mining by creating a sketch-based private representation of each data record, wherein each data record contains only a small number of non-zero attribute values in relation to the high dimensionality of the data records; wherein the sketch of a record Xl . . . Xd is defined by the quantity sj such that;
SJ=Σ
i=1dxi·
rijwhere the random variable rij is drawn from {−
1, +1} with a mean of 0. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for preserving privacy in mining of sparse high dimensional data records, comprising a computer usable storage medium having a computer readable program, wherein the computer readable program, when executed on a computer, causes the computer to:
-
receive the data records having high dimensionality; and transform the data records having high dimensionality into anonymized data records for use in data mining by creating a sketch-based private representation of each data record, wherein each data record contains only a small number of non-zero attribute values in relation to the high dimensionality of the data records; wherein the sketch of a record X1 . . . Xd is defined by the quantity sj such that;
SJ=Σ
i=1dxi·
rijwhere the random variable rij is drawn from {−
1, +1} with a mean of 0. - View Dependent Claims (9, 10, 11, 12, 13)
-
Specification