Machine learning method
First Claim
1. A computer-executable method for using machine learning to predict an outcome, the method comprising:
- defining a first outcome associated with a first range of medical costs at least as great as a cost threshold;
defining a second outcome associated with a second range of medical costs less than the cost threshold, wherein the second outcome is more likely than the first outcome; and
processing training data with a machine learning system, wherein said training data is a subset of a data set and is recorded in a computer-readable medium, and wherein the act of processing the training data includes;
selecting a first subset of the training data, the first subset corresponding to the first outcome;
selecting a second subset of the training data, the second subset corresponding to the second outcome and consisting of a set of nearby neighbors to the first outcome; and
selecting a third subset of the training data, the third subset corresponding to the second outcome, wherein the third subset does not consist of nearby neighbors to the first outcome; and
using a plurality of software-based, computer-executable machine learners to develop from the first, second and third subsets one or more sets of computer-executable rules usable to predict the first outcome or the second outcome.
15 Assignments
0 Petitions
Accused Products
Abstract
A method for using machine learning to solve problems having either a “positive” result (the event occurred) or a “negative” result (the event did not occur), in which the probability of a positive result is very low and the consequences of the positive result are significant. Training data is obtained and a subset of that data is distilled for application to a machine learning system. The training data includes some records corresponding to the positive result, some nearest neighbors from the records corresponding to the negative result, and some other records corresponding to the negative result. The machine learning system uses a co-evolution approach to obtain a rule set for predicting results after a number of cycles. The machine system uses a fitness function derived for use with the type of problem, such as a fitness function based on the sensitivity and positive predictive value of the rules. The rules are validated using the entire set of training data.
-
Citations
26 Claims
-
1. A computer-executable method for using machine learning to predict an outcome, the method comprising:
-
defining a first outcome associated with a first range of medical costs at least as great as a cost threshold;
defining a second outcome associated with a second range of medical costs less than the cost threshold, wherein the second outcome is more likely than the first outcome; and
processing training data with a machine learning system, wherein said training data is a subset of a data set and is recorded in a computer-readable medium, and wherein the act of processing the training data includes;
selecting a first subset of the training data, the first subset corresponding to the first outcome;
selecting a second subset of the training data, the second subset corresponding to the second outcome and consisting of a set of nearby neighbors to the first outcome; and
selecting a third subset of the training data, the third subset corresponding to the second outcome, wherein the third subset does not consist of nearby neighbors to the first outcome; and
using a plurality of software-based, computer-executable machine learners to develop from the first, second and third subsets one or more sets of computer-executable rules usable to predict the first outcome or the second outcome. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 24, 25, 26)
-
-
16. A computer-executable method for using machine learning to predict results comprising the act of:
-
processing a representation of a subset of a data set with a machine learning system, the representation comprising;
first data corresponding to a first outcome, wherein the first outcome is associated with a range of medical costs at least as great as a predetermined threshold amount;
second data corresponding to a second outcome, wherein the second outcome is associated with a range of medical costs lower than the predetermined threshold amount, wherein the second data consists of a set of nearby neighbors to the first outcome, and wherein the second outcome is less likely than the first outcome; and
third data corresponding to the second outcome, wherein the third data is different than the second data;
repeating for a plurality of cycles;
using a plurality of software-based, computer-executable machine learners to develop a set of computer executable rules from the processed representation of the subset of the data set;
evaluating the set of computer-executable rules using a user-selectable fitness function; and
modifying the machine learning methods executed by a plurality of software-based, computer-executable machine learners by using the results of the evaluating act; and
presenting a final set of computer-executable rules usable to predict the first outcome or the second outcome. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A computer-executable method for using machine learning to predict a positive or a negative outcome, where the positive outcome is less likely than the negative outcome, the method comprising:
-
defining a positive outcome associated with a range of medical costs equal to or greater than a cost threshold;
defining a negative outcome associated with a range of medical costs less than the cost threshold; and
processing training data with a machine learning system, wherein said training data is a subset of a data set and is recorded in a computer-readable medium, and wherein the act of processing the training data includes;
selecting a first subset of the training data, the first subset corresponding to the positive outcome;
selecting a second subset of the training data, the second subset corresponding to the negative outcome and consisting of a set of nearest neighbors to the positive outcome;
selecting a third subset of the training data, the third subset corresponding to the negative outcome, wherein the third subset does not consist of nearest neighbors to the positive outcome; and
using a plurality of software-based, computer-executable machine learners to develop from the first, second and third subsets of the training data one or more sets of computer-executable rules usable to predict either the positive outcome or the negative outcome. - View Dependent Claims (22, 23)
-
Specification