SYSTEMS AND METHODS FOR ANALYZING DATA TO PREDICT MEDICAL OUTCOMES
First Claim
1. A computer-executable method for analyzing data to predict medical outcomes, the method comprising:
- receiving data associating feature variables comprising demographic data of a plurality of patients with outcome variables corresponding to medical conditions of the plurality of patients, wherein the data comprises a first data set associated with a first outcome and a second data set associated with a second outcome, the second outcome being substantially less likely than the first outcome;
identifying within the first data set a third data set that consists essentially of nearby neighbors to the second data set; and
processing the first, second and third data sets to generate at least one set of computer-executable rules for predicting the likelihood of the first outcome or the second outcome.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for using machine learning to solve problems having either a “positive” result (the event occurred) or a “negative” result (the event did not occur), in which the probability of a positive result is very low and the consequences of the positive result are significant. Training data is obtained and a subset of that data is distilled for application to a machine learning system. The training data includes some records corresponding to the positive result, some nearest neighbors from the records corresponding to the negative result, and some other records corresponding to the negative result. The machine learning system uses a co-evolution approach to obtain a rule set for predicting results after a number of cycles. The machine system uses a fitness function derived for use with the type of problem, such as a fitness function based on the sensitivity and positive predictive value of the rules. The rules are validated using the entire set of training data.
43 Citations
20 Claims
-
1. A computer-executable method for analyzing data to predict medical outcomes, the method comprising:
-
receiving data associating feature variables comprising demographic data of a plurality of patients with outcome variables corresponding to medical conditions of the plurality of patients, wherein the data comprises a first data set associated with a first outcome and a second data set associated with a second outcome, the second outcome being substantially less likely than the first outcome;
identifying within the first data set a third data set that consists essentially of nearby neighbors to the second data set; and
processing the first, second and third data sets to generate at least one set of computer-executable rules for predicting the likelihood of the first outcome or the second outcome. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for using machine learning to predict a medical outcome, the system comprising:
-
medical data associating feature variables comprising demographic data with outcome variables, wherein the medical data comprises a first data set associated with a first outcome and a second data set associated with a second outcome substantially less likely than the first outcome;
a processing module configured to identify a first subset of the first data set consisting essentially of non-nearby neighbors to the second data set, a second subset within the first data set consisting essentially of nearby neighbors to the second data set, and a third subset of the second data set; and
a plurality of machine learners configured to develop from the first, second and third subsets at least one set of computer-executable rules usable to predict the first outcome or the second outcome. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer system for using machine learning to predict an outcome associated with a medical condition, the computer system comprising:
-
means for storing data associating feature variables comprising demographic data of a plurality of patients with outcome variables corresponding to medical conditions of the plurality of patients, wherein the data comprises a first data set associated with a first outcome and a second data set associated with a second outcome, the second outcome being substantially less likely than the first outcome;
means for identifying within the first data set a third data set that consists essentially of nearby neighbors to the second data set; and
means for processing the first, second and third data sets to generate at least one set of computer-executable rules for predicting the likelihood of the first outcome or the second outcome. - View Dependent Claims (20)
-
Specification