×

Systems and techniques for determining the predictive value of a feature

  • US 10,366,346 B2
  • Filed: 10/21/2016
  • Issued: 07/30/2019
  • Est. Priority Date: 05/23/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for building a predictive model, comprising:

  • determining a multi-model predictive value of a feature of an initial dataset representing a prediction problem, wherein the initial dataset includes a plurality of observations and each observation includes respective values for a plurality of features, including;

    (a) performing one or more predictive modeling procedures, wherein each of the predictive modeling procedures is associated with a different type of predictive model, wherein performing each modeling procedure comprises fitting the associated predictive model to the initial dataset;

    (b) reducing the multi-model predictive value of the feature by shuffling values of the feature across respective observations included in the initial dataset, thereby generating a modified dataset;

    (c) for each of the fitted predictive models;

    (c1) determining a first accuracy score representing an accuracy with which the fitted model generates predictions for data in the initial dataset;

    (c2) determining a second accuracy score representing an accuracy with which the fitted model generates predictions for data in the modified dataset in which the multi-model predictive value of the feature has been reduced; and

    (c3) determining a model-specific predictive value of the feature based on the first and second accuracy scores of the fitted model; and

    (d) determining, based on the model-specific predictive values of the feature, that the multi-model predictive value of the feature is low;

    performing feature engineering on the initial dataset based on the multi-model predictive value of the feature, including pruning the feature having the low multi-model predictive value from the initial dataset, thereby generating a pruned dataset; and

    building a predictive model for the prediction problem, including;

    performing a plurality of predictive modeling procedures on the pruned dataset, selecting a fitted predictive model generated by the plurality of predictive modeling procedures, and deploying the selected predictive model to predict outcomes of the prediction problem without using the pruned feature.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×