×

Systems and methods for generating biomarker signatures with integrated bias correction and class prediction

  • US 10,339,464 B2
  • Filed: 06/21/2013
  • Issued: 07/02/2019
  • Est. Priority Date: 06/21/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of classifying a data set into two or more classes, comprising:

  • (a) receiving, by a biomarker generator, a training data set and a training class set, the training class set including a set of known labels, each known label identifying a class associated with each element in the training data set;

    (b) receiving, by the biomarker generator, a test data set;

    (c) generating, by the biomarker generator, a first classifier for the training data set by applying a first machine learning technique to the training data set and the training class set;

    (d) generating, by the biomarker generator, a first test class set by classifying the elements in the test data set according to the first classifier;

    (e) transforming, by the biomarker generator, the training data set by shifting the elements in the training data set by an amount corresponding to a center of a set of training class centroids, wherein each training class centroid is representative of a center of a subset of elements in the training data set;

    (f) for each of a plurality of iterations;

    (i) transforming, by the biomarker generator, the test data set by shifting the elements in the test data set by an amount corresponding to a center of a set of test class centroids, wherein each test class centroid is representative of a center of a subset of elements in the test data set;

    (ii) generating, by the biomarker generator, a second test class set by classifying the elements in the transformed test data set according to a second classifier, wherein the second classifier is generated by applying a second machine learning technique to the transformed training data set and the training class set; and

    (iii) storing, by the biomarker generator, when the first test class set and the second test class set differ, the second test class set as the first test class set and the transformed test data set as the test data set and returning to step (i); and

    (g) outputting, by the biomarker generator, when the first test class set is the same as the second test class set, the second test class set.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×