Apparatus and Methods For Anonymizing a Data Set
First Claim
1. A computerized method for anonymizing a dataset that associates each entity included in a set of entities with a set of two or more attribute values, wherein each said attribute value associated with the entity is associated with a different entity attribute, the method being performed by a computer system and comprising:
- receiving preference information;
for each entity included in the set of entities, transforming said set of two or more attribute values associated with the entity using the preference information, thereby creating for the entity a set of two or more transformed attribute values;
clustering the entities included in the set of entities using said transformed attribute values to form at least a first entity cluster consisting of a first subset of the entities and a second entity cluster consisting of a second subset of the entities, wherein no entity included in the first entity cluster is included in the second entity cluster;
anonymizing the first subset of entities; and
anonymizing the second subset of entities.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems are disclosed for anonymizing a dataset that correlates a set of entities with respective attributes. The method may include: for each entity included in a set of entities, transforming two or more attribute values associated with the entity using received preference information, thereby creating for the entity a set of two or more transformed attribute values; clustering the entities included in the set of entities using said transformed attribute values to form at least a first entity cluster consisting of a first subset of the entities and a second entity cluster consisting of a second subset of the entities, wherein no entity included in the first entity cluster is included in the second entity cluster; anonymizing the first subset of entities; and anonymizing the second subset of entities.
24 Citations
16 Claims
-
1. A computerized method for anonymizing a dataset that associates each entity included in a set of entities with a set of two or more attribute values, wherein each said attribute value associated with the entity is associated with a different entity attribute, the method being performed by a computer system and comprising:
-
receiving preference information; for each entity included in the set of entities, transforming said set of two or more attribute values associated with the entity using the preference information, thereby creating for the entity a set of two or more transformed attribute values; clustering the entities included in the set of entities using said transformed attribute values to form at least a first entity cluster consisting of a first subset of the entities and a second entity cluster consisting of a second subset of the entities, wherein no entity included in the first entity cluster is included in the second entity cluster; anonymizing the first subset of entities; and anonymizing the second subset of entities. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A database anonymizing apparatus, comprising
a data processing system; - and
a data storage system, wherein the data storage system stores a dataset that associates each entity included in a set of entities with a set of two or more attribute values, wherein each said attribute value associated with the entity is associated with a different entity attribute, the data storage system comprises a computer readable medium storing computer readable program code (CRPC), and the CRPC comprises; instructions for, for each entity included in the set of entities, transforming said set of two or more attribute values associated with the entity using received preference information, thereby creating for the entity a set of two or more transformed attribute values; instructions for clustering the entities included in the set of entities using said transformed attribute values to form at least a first entity cluster consisting of a first subset of the entities and a second entity cluster consisting of a second subset of the entities, wherein no entity included in the first entity cluster is included in the second entity cluster; instructions for anonymizing the first subset of entities; and instructions for anonymizing the second subset of entities. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- and
Specification