Exponential noise distribution to optimize database privacy and output utility
First Claim
Patent Images
1. A method for producing a noisy output that reduces predictability of data inputs while increasing accuracy of said noisy output, the method comprising:
- formulating a query against data associated with a plurality of privacy principals and stored in a database;
evaluating the query to determine a query diameter measurement;
performing the query on one or more data inputs;
calculating by a computer processor an output of the query;
calculating by a computer processor a substantially exponential distribution of noise values as a function of the query diameter measurement and a privacy parameter, the substantially exponential distribution being calculated according to a Laplacian distribution;
combining by a computer processor said output with a noise value selected from the calculated substantially exponential distribution of noise values to produce the noisy output; and
disclosing the noisy output.
2 Assignments
0 Petitions
Accused Products
Abstract
An amount of noise to add to a query output may be selected to preserve privacy of inputs while maximizing utility of the released output. Noise values can be distributed according to a substantially symmetric exponential density function (“exponential distribution”). That is, the most likely noise value can be zero, and noise values of increasing absolute value can decrease in probability according to the exponential function.
17 Citations
15 Claims
-
1. A method for producing a noisy output that reduces predictability of data inputs while increasing accuracy of said noisy output, the method comprising:
-
formulating a query against data associated with a plurality of privacy principals and stored in a database; evaluating the query to determine a query diameter measurement; performing the query on one or more data inputs; calculating by a computer processor an output of the query; calculating by a computer processor a substantially exponential distribution of noise values as a function of the query diameter measurement and a privacy parameter, the substantially exponential distribution being calculated according to a Laplacian distribution; combining by a computer processor said output with a noise value selected from the calculated substantially exponential distribution of noise values to produce the noisy output; and disclosing the noisy output. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer storage medium having stored thereon computer executable instructions executed by a computer processor for producing a noisy output that reduces predictability of data inputs while increasing accuracy of said noisy output comprising:
-
formulating a query against data associated with a plurality of privacy principals and stored in a database; evaluating the query to determine a query diameter measurement; performing the query on one or more data inputs; calculating an output of the query; calculating a substantially exponential distribution of noise values as a function of the query diameter measurement and a privacy parameter, the substantially exponential distribution being calculated according to a Laplacian distribution; combining said output with a noise value selected from the calculated substantially exponential distribution of noise values to produce the noisy output; and disclosing the noisy output. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system for producing a noisy output that reduces predictability of data inputs while increasing accuracy of said noisy output, the system comprising:
-
a computer processor; and a memory having stored therein instructions comprising; formulating a query against data associated with a plurality of privacy principals and stored in a database; evaluating the query to determine a query diameter measurement; performing the query on one or more data inputs; calculating an output of the query; calculating a substantially exponential distribution of noise values as a function of the query diameter measurement and a privacy parameter, the substantially exponential distribution being calculated according to a Laplacian distribution; combining said output with a noise value selected from the calculated substantially exponential distribution of noise values to produce the noisy output; and disclosing the noisy output. - View Dependent Claims (12, 13, 14, 15)
-
Specification