×

Data analysis and predictive systems and related methodologies

  • US 9,195,949 B2
  • Filed: 03/30/2015
  • Issued: 11/24/2015
  • Est. Priority Date: 10/15/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of decreasing a risk of disease in a person x, comprising:

  • (A) obtaining a single nucleotide polymorphism (SNP) transductive model Mx suitable for use in data analysis, wherein the risk of disease specific to the person x is represented as input vector x, which comprises a plurality of variable features in relation to the risk of disease for which there is a global dataset D of samples also having the same variable features relating to the risk of disease as input vector x, and for which an outcome is known,(B) optimizing the transductive model bya) determining what number and a subset Vx of variable features of input vector x will be used in assessing an outcome for the input vector x;

    b) determining what number Kx of samples from within the global data set D will form a neighborhood about input vector x;

    c) selecting suitable Kx samples from the global data set which have the variable features that most closely accord to the variable features of the person x to form the neighborhood Dx;

    d) ranking the Vx variable features within the neighborhood Dx in order of importance to the outcome and obtaining a weight vector Wx for all variable features Vx;

    e) creating a prognostic transductive model Mx for each input vector x, having a set of model parameters Px and the other parameters Vx and Kx from elements a)-d);

    f) testing an accuracy of the model Mx for each sample from Dx by a method selected from the group consisting of;

    (i) calculating Wx as normalized SNR (Signal-to-Noise Ratio) coefficients and sorting the variables in descending order;

    V1, V2, . . . , Vv, where;

    w1>

    =w2>

    = . . . >

    =wv, calculated as follows;


    wl=abs(Ml(class 1,x)

    M
    l(class 2,x))/(Stdl(class1)+Stdlclass2));

    (ii) testing for a plurality of variables Vx a plurality of possible combinations of values of their weights Wx tested through a search to increase the overall accuracy of a model built on the data Dx;

    (iii) applying a genetic statistical analysis procedure, if the number of variables prevents using method (ii) above;

    (iv) applying a quantum inspired evolutionary statistical analysis technique, to select the optimal variable set Vx for every new input vector x and to weigh the variables through a probability wave function;

    g) storing both the accuracy and the set of model parameters;

    h) repeating elements a) and/or b) while applying an optimization procedure to optimize Vx and/or Kx, to determine their optimal values, before repeating elements c)-h) until the accuracy is maximized, wherein a number and a subset Vx of variable features of input vector x, and a number Kx of samples from within the global data set D that form a neighborhood about input vector x are determined anew each time elements a) and b) are repeated while applying an optimization procedure to optimize Vx and/or Kx;

    (C) creating a SNP profile of sample x from patient x and a corresponding gene profile by mapping the SNPs from a final set Vx into genes;

    (D) determining the risk of disease specific to the patient x using the optimized transductive model Mx by;

    (I) forming a vector;

    Fx={Vx,Wx,Kx,Dx,Mx,Px,t}, where the variable t represents the time of the model Mx creation;

    (II) calculating the weighted distance D(Fx,Fd) as an aggregated indication of how much a person'"'"'s profile should change to reach an average desired profile Fd;


    D(Fx,Fd)=Σ

    l=1,vabs(Vlx

    V
    ld

    wl;

    (III) designing a vector of required variable changes, defined as;


    deltaFx,d=(deltaVlx,d),for l−

    1,v
    as follows;





    (20)
    deltaVlx,d=Vlx

    V
    ld, with an importance of;

    wl



    (21)(E) modifying variable features Vx in the patient x to be closer to Kx values associated with an improved outcome relative to a prognostic outcome y determined for the patient x so as to improve the prognostic outcome of the patient x;

    (F) repeating elements a) through h) to determine an improved prognostic outcome using re-optimized transductive model Mx; and

    (G) creating a scenario for treatment/drug design that includes a set of SNPs/genes and required changes for the person x to match in future, average profiles of control samples from Dx in order to decrease the risk of disease.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×