Mahalanobis distance genetic algorithm (MDGA) method and system
First Claim
1. A computer-implemented method for identifying a desired variable subset, comprising:
- obtaining a set of data records corresponding to a plurality of variables;
defining the data records as normal data or abnormal data based on predetermined criteria;
initializing a genetic algorithm with a subset of variables from the plurality of variables;
calculating Mahalanobis distances of the normal data and the abnormal data based on the subset of variables; and
identifying a desired subset of the plurality of variables by performing the genetic algorithm based on the Mahalanobis distances.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method to provide a desired variable subset. The method may include obtaining a set of data records corresponding a plurality of variables and defining the data records as normal data or abnormal data based on predetermined criteria. The method may also include initializing a genetic algorithm with a subset of variables from the plurality of variables and calculating Mahalanobis distances of the normal data and the abnormal data based on the subset of variables. Further, the method may include identifying a desired subset of the plurality of variables by performing the genetic algorithm based on the Mahalanobis distances.
53 Citations
29 Claims
-
1. A computer-implemented method for identifying a desired variable subset, comprising:
-
obtaining a set of data records corresponding to a plurality of variables;
defining the data records as normal data or abnormal data based on predetermined criteria;
initializing a genetic algorithm with a subset of variables from the plurality of variables;
calculating Mahalanobis distances of the normal data and the abnormal data based on the subset of variables; and
identifying a desired subset of the plurality of variables by performing the genetic algorithm based on the Mahalanobis distances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for defining normal data and abnormal data from a data set, comprising:
-
obtaining two or more clusters by applying a clustering algorithm to the data set;
determining a first cluster and a second cluster that have a largest difference in normalized means; and
defining the first cluster as normal data and the second cluster as abnormal data. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer system, comprising:
-
a console;
at least one input device; and
a central processing unit (CPU) configured to;
obtain a set of data records corresponding to a plurality of variables, wherein a total number of the data records is less than a total number of the plurality of variables;
define the data records as normal data or abnormal data based on predetermined criteria;
initialize a genetic algorithm with a subset of variables from the plurality of variables;
calculate Mahalanobis distances of the normal data and the abnormal data based on the subset of variables; and
identify a desired subset of the plurality of variables by performing the genetic algorithm based on the Mahalanobis distances. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. A computer-readable medium for use on a computer system configured to perform a variable reducing procedure, the computer-readable medium having computer-executable instructions for performing a method comprising:
-
obtaining a set of data records corresponding to a plurality of variables, wherein a total number of the data records is less than a total number of the plurality of variables;
defining the data records as normal data or abnormal data based on predetermined criteria;
initializing a genetic algorithm with a subset of variables from the plurality of variables;
calculating Mahalanobis distances of the normal data and the abnormal data based on the subset of variables; and
identifying a desired subset of the plurality of variables by performing the genetic algorithm based on the Mahalanobis distances. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29)
-
Specification