System and method for distributed privacy preserving data mining
First Claim
1. A method of data mining in a privacy-preserving manner in a distributed computing environment including a plurality of entities, comprising the steps of:
- a first entity of the plurality of entities exchanging summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity; and
the first entity mining data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol;
wherein the summary information exchanging step comprises;
the first and second entities each generating a random number;
in a first round, the first entity and the second entity adding their random numbers to a global count, in a given order, to generate an overall global count, wherein a current count value that an entity receives from a previous entity in the given order is the global count; and
in a second round, the first entity and second entity adding a first value and a second value, respectively, to the overall global count, in the given order, the first and second values representing private data of the first and second entities.
0 Assignments
0 Petitions
Accused Products
Abstract
Distributed privacy preserving data mining techniques are provided. A first entity of a plurality of entities in a distributed computing environment exchanges summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity. The first entity may then mine data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol. The first entity may obtain, from the second entity via the privacy-preserving data sharing protocol, information relating to the number of transactions in which a particular itemset occurs and/or information relating to the number of transactions in which a particular rule is satisfied.
-
Citations
17 Claims
-
1. A method of data mining in a privacy-preserving manner in a distributed computing environment including a plurality of entities, comprising the steps of:
-
a first entity of the plurality of entities exchanging summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity; and
the first entity mining data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol;wherein the summary information exchanging step comprises; the first and second entities each generating a random number; in a first round, the first entity and the second entity adding their random numbers to a global count, in a given order, to generate an overall global count, wherein a current count value that an entity receives from a previous entity in the given order is the global count; and in a second round, the first entity and second entity adding a first value and a second value, respectively, to the overall global count, in the given order, the first and second values representing private data of the first and second entities. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. Apparatus associated with a first entity in a distributed computing environment, including a plurality of entities, for data mining in a privacy-preserving manner, comprising:
-
a memory; and at least one processor coupled to the memory and operative to;
(i) exchange summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity; and
(ii) mine data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol;wherein the summary information exchanging operation comprises; generating a random number by the first entity; in a first round, the first entity adding its random number to a global count and transmitting the global count to the second entity, and the second entity adding a second random number to the global count, wherein the first and second entities add their random numbers to the global count, in a given order, to generate an overall global count, wherein a current count value that an entity receives from a previous entity in the given order is the global count; and in a second round, the first entity and second entity adding a first value and a second value, respectively, to the overall global count, in the given order, the first and second values representing private data of the first and second entities. - View Dependent Claims (13, 14, 15, 16)
-
-
17. An article of manufacture for use with a first entity in a distributed computing environment, including a plurality of entities, the article of manufacture comprising one or more programs which when executed by a computer, implement a method for data mining in a privacy-preserving manner comprising the steps of:
-
the first entity exchanging summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity; and the first entity mining data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol; wherein the summary information exchanging step comprises; the first and second entities each generating a random number; in a first round, the first entity and the second entity adding their random numbers to a global count, in a given order, to generate an overall global count, wherein a current count value that an entity receives from a previous entity in the given order is the global count; and in a second round, the first entity and second entity adding a first value and a second value, respectively, to the overall global count, in the given order, the first and second values representing private data of the first and second entities.
-
Specification