Probabilistic inference in differentially private systems
First Claim
Patent Images
1. A method comprising:
- generating a noisy result using a differentially private computation from a private data set by a computing device, wherein the differentially private computation adds noise to the private data set using an exponential mechanism;
determining a conditional distribution of the differentially private computation wherein the conditional distribution describes a probability distribution for the noisy result;
determining a posterior distribution for the differentially private computation by the computing device using preexisting knowledge about one or more records of the private data set and the conditional distribution, wherein the preexisting knowledge comprises one or more of information about a user whose data is part of the private data set or information about a total number of records of the private data set;
wherein determining the posterior distribution using the preexisting knowledge about the one or more records of the private data set and the conditional distribution comprises;
retrieving a plurality of results from previous executions of the differentially private computation that were generated in response to previously received queries; and
inferring the posterior distribution using the conditional distribution, the plurality of results, and the preexisting knowledge about the one or more records of the private data set using probabilistic inference; and
providing the posterior distribution by the computing device, wherein the posterior distribution includes the probability that the generated noisy result is a true result from the private data set.
2 Assignments
0 Petitions
Accused Products
Abstract
Given that a differentially private mechanism has a known conditional distribution, probabilistic inference techniques may be used along with the known conditional distribution, and generated results from previously computed queries on private data, to generate a posterior distribution for the differentially private mechanism used by the system. The generated posterior distribution may be used to describe the probability of every possible result being the correct result. The probability may then be used to qualify conclusions or calculations that may depend on the returned result.
35 Citations
16 Claims
-
1. A method comprising:
-
generating a noisy result using a differentially private computation from a private data set by a computing device, wherein the differentially private computation adds noise to the private data set using an exponential mechanism; determining a conditional distribution of the differentially private computation wherein the conditional distribution describes a probability distribution for the noisy result; determining a posterior distribution for the differentially private computation by the computing device using preexisting knowledge about one or more records of the private data set and the conditional distribution, wherein the preexisting knowledge comprises one or more of information about a user whose data is part of the private data set or information about a total number of records of the private data set; wherein determining the posterior distribution using the preexisting knowledge about the one or more records of the private data set and the conditional distribution comprises; retrieving a plurality of results from previous executions of the differentially private computation that were generated in response to previously received queries; and inferring the posterior distribution using the conditional distribution, the plurality of results, and the preexisting knowledge about the one or more records of the private data set using probabilistic inference; and providing the posterior distribution by the computing device, wherein the posterior distribution includes the probability that the generated noisy result is a true result from the private data set. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
receiving a noisy first result at a computing device through a network, wherein the noisy first result is generated from a second result of a private data set using a differentially private computation that adds noise to the private data set using an exponential mechanism; determining a conditional distribution of the differentially private computation by the computing device, wherein the conditional distribution of the differentially private computation describes a probability distribution for the noisy first result; retrieving a plurality of results from previous executions of the differentially private computation that were generated in response to previously received queries; probabilistically inferring a posterior distribution of the differentially private computation using the conditional distribution, the plurality of results, and preexisting knowledge about one or more records of the private data set by the computing device, wherein the preexisting knowledge comprises one or more of information about a user whose data is part of the private data set or information about a total number of records of the private data set; and providing the posterior distribution by the computing device through the network, wherein the posterior distribution includes the probability that the noisy first result is equal to the second result from the private data set. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A system comprising:
-
a computing device; a privacy integrated platform that generates a noisy first result from a second result of a private data set using a differentially private computation, wherein the differentially private computation adds noise to the private data set using an exponential mechanism; and an inference engine that; determines a conditional distribution of the differentially private computation, wherein the conditional distribution of the differentially private computation describes a probability distribution for the noisy first result; generates a posterior distribution for the differentially private computation using preexisting knowledge about one or more records of the private data set and the conditional distribution, wherein the preexisting knowledge comprises one or more of information about a user whose data is part of the private data set or information about a total number of records of the private data set, and wherein generating the posterior distribution using the preexisting knowledge about the one or more records of the private data set and the conditional distribution comprises; retrieving a plurality of results from previous executions of the differentially private computation that were generated in response to previously received queries; and inferring the posterior distribution using the conditional distribution, the plurality of results, and the preexisting knowledge about the one or more records of the private data set using probabilistic inference; receives the generated noisy first result; and provides the generated noisy first result and the generated posterior distribution, wherein the posterior distribution includes the probability that the generated noisy first result is equal to the second result from the private data set. - View Dependent Claims (15, 16)
-
Specification