×

Methods and apparatus for privacy preserving data mining using statistical condensing approach

  • US 7,302,420 B2
  • Filed: 08/14/2003
  • Issued: 11/27/2007
  • Est. Priority Date: 08/14/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method of generating at least one output data set from at least one multidimensional static input data set for use in association with a data mining process, comprising the steps of:

  • generating data statistics from the at least one multidimensional static input data set in an iterative manner in accordance with one or more records from the at least one static input data set included in at least one condensed data group, and further comprising the steps of forming at least one condensed data group having a specific number of records from the static data set closest to a given record of the static data set, generating first order statistics and second order statistics for the at least one condensed data group, deleting records from the static data set that are included in the at least one condensed data group, determining if records remain in the static data set, and forming additional condensed data groups if records remain in the static data set;

    generating the at least one output data set from the data statistics, wherein the output data set differs from the static input data set but maintains one or more correlations from within the static input data set; and

    storing the at least one output data set in a storage device for use by a user.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×