×

Differentially private machine learning using a random forest classifier

  • US 10,726,153 B2
  • Filed: 09/27/2018
  • Issued: 07/28/2020
  • Est. Priority Date: 11/02/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving a request from a client to generate a differentially private random forest classifier trained using a set of restricted data stored by a private database system, the request identifying a level of differential privacy corresponding to the request, the identified level of differential privacy comprising privacy parameters ε and

    δ

    , wherein ε

    describes a degree of information released about the set of restricted data due to the request and δ

    describes an improbability of the request satisfying (ε

    )-differential privacy;

    generating the differentially private random forest classifier in response to the request, generating the classifier comprising;

    determining a number of decision trees comprising the differentially private random forest classifier;

    generating the determined number of decision trees, wherein a decision tree comprises a plurality of leaf nodes representing classification categories, and generating the decision tree comprises;

    generating a set of splits based on features of the set of restricted data;

    determining an information gain for each split of the set of splits;

    selecting a split from the set of splits using an exponential mechanism based at least in part on the determined information gains of the splits in the set and at least one of the privacy parameters;

    adding the selected split to the decision tree at a node; and

    determining, for a certain leaf node of the plurality of leaf nodes representing a certain classification category of the classification categories, a differentially private count of entities in the set of restricted data in the certain classification category; and

    providing the differentially private random forest classifier to the client, the provided differentially private random forest classifier comprising the differentially private count of entities in the certain classification category represented by the certain leaf node.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×