×

System and method for data anonymization using hierarchical data clustering and perturbation

  • US 9,135,320 B2
  • Filed: 06/13/2013
  • Issued: 09/15/2015
  • Est. Priority Date: 06/13/2012
  • Status: Active Grant
First Claim
Patent Images

1. A system for data anonymization comprising:

  • a computer system for electronically receiving an original dataset and allowing a user to specify a relative importance of at least one attribute of the dataset; and

    an anonymization program executed by the computer system for producing an anonymized dataset from the original dataset, the anonymization program executing;

    a vector space mapping sub-process for converting each record of the original dataset to a normalized vector that can be compared to other vectors;

    a hierarchical clustering sub-process for dividing the normalized vectors into disjointed k-sized groups of similar records based on a hierarchical clustering technique;

    a perturbation sub-process for generating anonymized clusters from individual clusters generated by the hierarchical clustering sub-process; and

    an original domain mapping sub-process to combine and remap anonymized clusters back to an original domain of the original dataset.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×