Methods of identifying patterns in biological systems and uses thereof
DC CAFCFirst Claim
Patent Images
1. A computer-implemented method for identifying patterns in data, the method comprising:
- (a) inputting into at least one support vector machine of a plurality of support vector machines a training set having known outcomes, the at least one support vector machine comprising a decision function having a plurality of weights, each having a weight value, wherein the training set comprises features corresponding to the data and wherein each feature has a corresponding weight;
(b) optimizing the plurality of weights so that classifier error is minimized;
(c) computing ranking criteria using the optimized plurality of weights;
(d) eliminating at least one feature corresponding to the smallest ranking criterion;
(e) repeating steps (a) through (d) for a plurality of iterations until a subset of features of pre-determined size remains; and
(f) inputting into the at least one support vector machine a live set of data wherein the features within the live set are selected according to the subset of features.
5 Assignments
Litigations
2 Petitions
Accused Products
Abstract
The methods, systems and devices of the present invention comprise use of Support Vector Machines and RFE (Recursive Feature Elimination) for the identification of patterns that are useful for medical diagnosis, prognosis and treatment. SVM-RFE can be used with varied data sets.
98 Citations
23 Claims
-
1. A computer-implemented method for identifying patterns in data, the method comprising:
-
(a) inputting into at least one support vector machine of a plurality of support vector machines a training set having known outcomes, the at least one support vector machine comprising a decision function having a plurality of weights, each having a weight value, wherein the training set comprises features corresponding to the data and wherein each feature has a corresponding weight; (b) optimizing the plurality of weights so that classifier error is minimized; (c) computing ranking criteria using the optimized plurality of weights; (d) eliminating at least one feature corresponding to the smallest ranking criterion; (e) repeating steps (a) through (d) for a plurality of iterations until a subset of features of pre-determined size remains; and (f) inputting into the at least one support vector machine a live set of data wherein the features within the live set are selected according to the subset of features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-implemented method for identifying determinative genes for use in diagnosis, prognosis or treatment of a disease, the method comprising:
-
(a) inputting into a support vector machine a training data set of gene expression data having known outcomes with respect to the disease, the support vector machine comprising a decision function having a plurality of weights, each having a weight value, wherein the training set comprises features corresponding to the gene expression data and each feature has a corresponding weight; (b) optimizing the plurality of weights so that classifier error is minimized; (c) computing ranking criteria using the optimized plurality of weights; (d) eliminating at least one feature corresponding to the smallest ranking criterion; (e) repeating steps (a) through (d) for a plurality of iterations until an optimum subset of features remains; and (f) inputting into the support vector machine a live data set of gene expression data wherein the features within the live data set are selected according to the optimum subset of features. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer-implemented method for identifying patterns in biological data, the method comprising:
-
(a) inputting into at least some of a plurality of support vector machines a training data set, wherein the training data set comprises features corresponding to the biological data and each feature has a corresponding weight, and wherein each support vector machine comprises a decision function having a plurality of weights; (b) optimizing the plurality of weights so that classification confidence is optimized; (c) computing ranking criteria using the optimized plurality of weights; (d) eliminating at least one feature corresponding to the smallest ranking criteria; (e) repeating steps (a) through (d) for a plurality of iterations until an optimum subset of features remains; and (f) inputting into the plurality of support vector machines a live set of biological data wherein the features within the live set are selected according to the optimum subset of features. - View Dependent Claims (20, 21, 22, 23)
-
Specification