Predictive modeling accuracy
First Claim
1. A computer-implemented method, the method comprising:
- receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer;
generating a plurality of modified training data sets, wherein generating each of the modified training data sets comprises applying a respective filter or combination of filters to examples in the training data set to generate the examples of the modified training data set, the filter or combination of filters altering one or more of the examples;
training a plurality of predictive models, including training each of one or more predictive models using a different respective modified training data set of the plurality of modified training data sets;
determining a respective accuracy for each of the plurality of predictive models;
identifying a most accurate predictive model based on the determined respective accuracies; and
specifying an association between the training data set and the filter or combination of filters used to generate the modified training data set that was used to train the most accurate predictive model.
2 Assignments
0 Petitions
Accused Products
Abstract
In general, a method includes receiving a training data set that includes a plurality of examples, wherein each example includes one or more features and an answer, generating a plurality of modified training data sets by applying one or more filters to the training data set, each of the plurality of modified training data sets being based on a different combination of the one or more filters, training a plurality of predictive models, each of the plurality of predictive models being trained using a different modified training data set of the plurality of modified training data sets, determining a respective accuracy for each of the plurality of predictive models, identifying a most accurate predictive model based on the determined accuracies, and specifying an association between the training data set and the combination of filters used to generate the modified training data set that was used to train the most accurate predictive model.
78 Citations
38 Claims
-
1. A computer-implemented method, the method comprising:
-
receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer; generating a plurality of modified training data sets, wherein generating each of the modified training data sets comprises applying a respective filter or combination of filters to examples in the training data set to generate the examples of the modified training data set, the filter or combination of filters altering one or more of the examples; training a plurality of predictive models, including training each of one or more predictive models using a different respective modified training data set of the plurality of modified training data sets; determining a respective accuracy for each of the plurality of predictive models; identifying a most accurate predictive model based on the determined respective accuracies; and specifying an association between the training data set and the filter or combination of filters used to generate the modified training data set that was used to train the most accurate predictive model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. One or more non-transitory computer storage devices comprising instructions that, when executed by one or more processing devices, cause the one or more processing devices to perform operations comprising:
-
receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer; generating a plurality of modified training data sets, wherein generating each of the modified training data sets comprises applying a respective filter or combination of filters to examples in the training data set to generate the examples of the modified training data set, the filter or combination of filters altering one or more of the examples; training a plurality of predictive models, including training each of one or more predictive models using a different respective modified training data set of the plurality of modified training data sets; determining a respective accuracy for each of the plurality of predictive models; identifying a most accurate predictive model based on the determined respective accuracies; and specifying an association between the training data set and the filter or combination of filters used to generate the modified training data set that was used to train the most accurate predictive model. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
one or more processing devices; and one or more memory devices comprising instructions that, when executed by the one or more processing devices, cause the one or more processing devices to perform operations comprising; receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer; generating a plurality of modified training data sets, wherein generating each of the modified training data sets comprises applying a respective filter or combination of filters to examples in the training data set to generate the examples of the modified training data set, the filter or combination of filters altering one or more of the examples; training a plurality of predictive models, including training each of one or more predictive models using a different respective modified training data set of the plurality of modified training data sets; determining a respective accuracy for each of the plurality of predictive models; identifying a most accurate predictive model based on the determined respective accuracies; and specifying an association between the training data set and the filter or combination of filters used to generate the modified training data set that was used to train the most accurate predictive model. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A computer-implemented method, the method comprising:
-
receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer; identifying one or more characteristics of the training data set; generating a plurality of modified training data sets by applying one or more filters to the training data set, each of the plurality of modified training data sets being based on a different combination of the one or more filters; training a plurality of predictive models, each of the plurality of predictive models being trained using a different modified training data set of the plurality of modified training data sets; determining a respective accuracy for each of the plurality of predictive models; identifying a most accurate predictive model based on the determined respective accuracies; and specifying an association between the training data set and the one or more filters used to generate the modified training data set that was used to train the most accurate predictive model, including specifying an association between the one or more filters and the one or more characteristics of the training data set. - View Dependent Claims (35, 36, 37)
-
-
38. A computer-implemented method, the method comprising:
-
receiving a training data set that comprises a plurality of examples, wherein each example comprises one or more features and an answer; generating a plurality of modified training data sets by applying one or more filters to the training data set, each of the plurality of modified training data sets being based on a different combination of the one or more filters; training a plurality of predictive models, each of the plurality of predictive models being trained using a different modified training data set of the plurality of modified training data sets; determining a respective accuracy for each of the plurality of predictive models; training an unfiltered predictive model based on the training data set, the unfiltered predictive model being trained without an application of the one or more filters; and determining a level of accuracy associated with the unfiltered predictive model identifying a most accurate predictive model based on the determined respective accuracies; and specifying an association between the training data set and the one or more filters used to generate the modified training data set that was used to train the most accurate predictive model.
-
Specification