×

Privacy preserving statistical analysis for distributed databases

  • US 8,893,292 B2
  • Filed: 11/14/2012
  • Issued: 11/18/2014
  • Est. Priority Date: 11/14/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for securely determining aggregate statistics on private data, comprising the steps of:

  • randomizing, in a client, firstly and independently data X and Y to obtain randomized data {circumflex over (X)} and Ŷ

    , respectively, wherein the randomizing firstly preserves a privacy of the data X and Y, wherein the randomizing operates directly on the data X and Y, wherein the data X are produced by a first data source, and the data Y are produced by a second data source, and the data X and Y are produced independently in a distributed manner;

    randomizing, in the client, secondly independently the randomized data {circumflex over (X)} and Ŷ

    to obtain randomized data {tilde over (X)} and {tilde over (Y)} for a server, and helper information T{tilde over (X)}|{circumflex over (X)} and T{tilde over (Y)}|Ŷ

    for the client, respectively, wherein T represents an empirical distribution, and wherein the randomizing secondly preserves the privacy of the aggregate statistics of the data X and Y;

    determining, at the server, T{tilde over (X)},{tilde over (Y)};

    applying, by the client, the helper information T{tilde over (X)}|{circumflex over (X)} and T{tilde over (Y)}|Ŷ

    to T{tilde over (X)},{tilde over (Y)} to obtain an estimated {dot over (T)}X,Y, wherein “

    |” and



    ,”

    between X and Y represent a conditional and joint distribution, respectively.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×