×

System and method for automatic generation of features from datasets for use in an automated machine learning process

  • US 10,410,138 B2
  • Filed: 05/26/2016
  • Issued: 09/10/2019
  • Est. Priority Date: 07/16/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for generating a statistical classifier, comprising:

  • receiving a designation of a training dataset comprising a plurality of raw data instances each including a set of data objects assigned at least one value;

    applying a function to each raw data instance to calculate a set of first results comprising a collection of complex objects having a plurality of parameters built from a plurality of primitive types, wherein the function is selected from a plurality of functions;

    extracting a plurality of values from the set of first results;

    generating a plurality of candidate classification features, each respective candidate classification feature including;

    (i) the function that outputs a complex object, selected from the plurality of functions,(ii) at least one condition selected from a plurality of conditions, and(iii) a value selected from the plurality of extracted values,wherein the respective candidate classification feature outputs an output value computed by the at least one condition that compares between the complex object output of the function and the value selected from the plurality of extracted values;

    selecting a subset of pivotal classification features from the generated candidate classification features according to a correlation requirement between at least one classification variable and each respective candidate classification feature;

    andgenerating a statistical classifier for classification of the at least one classification variable based on the selected subset of pivotal features applied to a new training dataset.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×