System and method for using classification trees to predict rare events
First Claim
Patent Images
1. A method, comprising:
- loading a plurality of data records, wherein each data record has one or more attributes, wherein the plurality of data records include a first group;
assigning a relevant event to be predicted;
selecting at least one of the one or more attributes;
creating a plurality of subgroups associated with the first group, wherein each data record associated with the first group is associated with at least one subgroup, wherein the associating for each record is based at least in part on a respective value associated with the selected attribute; and
repeating the selecting and creating until a concentration of positive outcomes for the relevant event is sufficient.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods are provided for predicting rare events, such as hospitalization events. A set of data records, each containing multiple attributes with one or more values (which may include an “unknown” value), may represent a root node of a decision tree. This root node may be partitioned based on one of the attributes, such that the concentration (e.g., “purity”) of a relevant outcome (e.g., the rare event) is increased in one node and decreased in another. This process may be repeated until a decision tree with sufficiently pure leaf nodes is created. This “purified” decision tree may then be used to predict one or more rare events.
-
Citations
19 Claims
-
1. A method, comprising:
-
loading a plurality of data records, wherein each data record has one or more attributes, wherein the plurality of data records include a first group; assigning a relevant event to be predicted; selecting at least one of the one or more attributes; creating a plurality of subgroups associated with the first group, wherein each data record associated with the first group is associated with at least one subgroup, wherein the associating for each record is based at least in part on a respective value associated with the selected attribute; and repeating the selecting and creating until a concentration of positive outcomes for the relevant event is sufficient. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
a memory configured to load a plurality of data records, wherein each data record has one or more attributes, wherein the plurality of data records include a first group; a processor configured to assign a relevant event to be predicted; the processor configured to select at least one of the one or more attributes; the processor configured to create a plurality of subgroups associated with the first group, wherein each data record associated with the first group is associated with at least one subgroup, wherein the associating for each record is based at least in part on a respective value associated with the selected attribute; the processor further configured to repeat the selecting and creating until a concentration of positive outcomes for the relevant event is sufficient. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-readable storage medium encoded with instructions configured to be executed by a processor, the instructions which, when executed by the processor, cause the performance of a method, comprising:
-
loading a plurality of data records, wherein each data record has one or more attributes, wherein the plurality of data records include a first group; assigning a relevant event to be predicted; selecting at least one of the one or more attributes; creating a plurality of subgroups associated with the first group, wherein each data record associated with the first group is associated with at least one subgroup, wherein the associating for each record is based at least in part on a respective value associated with the selected attribute; and repeating the selecting and creating until a concentration of positive outcomes for the relevant event is sufficient.
-
Specification