×

Feature selection for efficient epistasis modeling for phenotype prediction

  • US 10,108,775 B2
  • Filed: 09/18/2013
  • Issued: 10/23/2018
  • Est. Priority Date: 01/21/2013
  • Status: Active Grant
First Claim
Patent Images

1. An information processing system for reducing search time of features in sample data and reducing computation time when training a model for generating specialized data based on the features, the information processing system comprising:

  • a memory;

    a processor communicatively coupled to the memory; and

    a feature selection circuit coupled to the memory and the processor, wherein the feature selection circuit is configured to perform a plurality of operations comprising;

    receiving a set of genetic markers and a phenotype from at least one external information processing system;

    training an epistasis effect model based on the set of genetic markers and the phenotype;

    reducing computation time of the information processing system during the training of the epistasis effect model and further reducing feature selection time of the epistasis effect model, wherein reducing the computation time and the feature selection time comprises;

    determining, for each of the set of genetic markers, a relevance score with respect to the phenotype according to I(xjtraining;

    ctraining), where I is mutual information between a given genetic marker xj and a phenotype c, where mutual information I between two variables x and y is defined, based on their joint marginal probabilities p(x) and p(y) and probabilistic distribution p(x, y), as;

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×