×

K-anonymity and L-diversity data anonymization in an in-memory database

  • US 10,565,398 B2
  • Filed: 10/26/2017
  • Issued: 02/18/2020
  • Est. Priority Date: 10/26/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving an indication to perform data anonymization based on quasi attributes of a data set, wherein the data set includes both the quasi attributes and one or more sensitive attributes;

    recursively performing partitioning of the data set based on one or more of the quasi attributes until both a first anonymization threshold corresponding to the quasi attributes is satisfied, wherein the first anonymization threshold is based on K-anonymity and indicates from how many other records that each record in one of the sub-partitions of the resultant data set is indistinguishable and a second anonymization threshold corresponding to the one or more sensitive attributes is satisfied for each of a plurality of sub-partitions produced as a result of the partitioning, wherein the second anonymization threshold is based on L-diversity and indicates a minimum number of sensitive values that exist in each sub-partition of the resultant data set; and

    providing a resultant data set including a plurality of records of the data set corresponding to the plurality of sub-partitions that satisfy both the first anonymization threshold and the second anonymization threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×