Methods for obtaining and using haplotype data
First Claim
1. A method of determining polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, comprising:
- (a) providing a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each individual polymorphism in the haplotypes for the degree to which it correlates with the outcome of interest and calculating a numerical measure of the degree of correlation;
(c) saving each individual polymorphism whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the numerical measure of the degree of correlation;
(f) saving each n-site sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) trough (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, computer program(s) and database(s) to analyze and make use of gene haplotype information. These include methods, program, and database to find and measure the frequency of haplotypes in the general population; methods, program, and database to find correlation'"'"'s between an individual'"'"'s haplotypes or genotypes and a clinical outcome; methods, program, and database to predict an individual'"'"'s haplotypes from the individual'"'"'s genotype for a gene; and methods, program, and database to predict an individual'"'"'s clinical response to a treatment based on the individual'"'"'s genotype or haplotype.
-
Citations
37 Claims
-
1. A method of determining polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, comprising:
-
(a) providing a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each individual polymorphism in the haplotypes for the degree to which it correlates with the outcome of interest and calculating a numerical measure of the degree of correlation;
(c) saving each individual polymorphism whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the numerical measure of the degree of correlation;
(f) saving each n-site sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) trough (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of determining polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, comprising:
-
(a) providing a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort, the locus comprising at least two polymorphic sites;
(b) statically analyzing each haplotype for the degree to which it correlates with the outcome of interest, and calculating a numerical measure of the degree of correlation;
(c) saving each haplotype whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) from each haplotype composed of m polymorphic sites, generating all possible sub-haplotypes having a single site masked, so as to provide a set of sub-haplotypes having (m-n) sites, where n=1;
(e) statistically analyzing each newly generated sub-haplotype for the degree to which it correlates with the outcome of interest, and calculating the numerical measure of the degree of correlation;
(f) saving each sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) from the saved sub-haplotypes, generating all possible sub-haplotypes having one additional site masked; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes or (ii) no further sub-haplotypes having more unmasked sites than a pre-selected limit can be generated. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to execute a method to determine polymorphisms or sub-haplotypes for a locus that correlate with an outcome of interest, the method comprising:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects, and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statically analyzing each individual polymorphism in the haplotypes for the degree to which it correlates with the outcome of interest and calculating a numerical measure of the degree of correlation;
(c) storing each individual polymorphism whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the numerical measure of the degree of correlation;
(f) storing each n-site sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated. - View Dependent Claims (16, 17, 18)
-
-
19. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to execute a method to determine polymorphic sits or sub-haplotypes for a locus that correlate with an outcome of interest, the method comprising:
-
(a) accessing a database comprising a haplotype from each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each individual polymorphism in the haplotypes for the degree to which it correlates with the outcome of interest and calculating the p-value for the correlation;
(c) storing each individual polymorphism whose p-value does not exceed a first cut-off value and its p-value;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the p-value for the correlation;
(f) storing each n-site sub haplotype whose p-value does not exceed the first cut-off value and its p-value;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new subhaplotypes with increased values of n; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated.
-
-
20. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, the method comprising:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each haplotype for the degree to which it correlates with the outcome of interest, and calculating a numerical measure of the degree of correlation;
(c) storing each haplotype whose numerical measure of the degree of correlation meets a first cut-off value criterion and its numerical measure of the degree of correlation;
(d) generating, from each haplotype composed of m polymorphic sites, all possible sub-haplotypes having a single site masked, so as to provide a set of m-n site sub-haplotypes where n=1;
(e) statistically analysing each newly generated sub-haplotype for the degree to which it correlates with the outcome of interest, and calculating the numerical measure of the degree of correlation;
(f) storing each sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) generating, from the stored sub-haplotypes, all possible sub-haplotypes having one additional site masked; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes, or (ii) no further sub-haplotypes having more unmasked sites than a pre-selected limit can be generated. - View Dependent Claims (21)
-
-
22. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, the method comprising:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each haplotype for the degree to which it correlates with the outcome of interest, and calculating the p-value for the correlation;
(c) storing each haplotype whose p-value does not exceed a first cut-off value and its p-value;
(d) generating, for each haplotype composed of m polymorphic sites, all possible sub-haplotypes having a single site masked, so as to provide a set of m-n site sub-haplotypes where n=1;
(e) statistically analyzing each newly generated sub-haplotype for the degree to which it correlates with the outcome of interest, and calculating the p-value for the correlation;
(f) storing those sub-haplotypes whose p-value does not exceed the first out-off value and its p-value;
(g) generating, from the stored sub-haplotypes, all possible sub-haplotypes having one additional site masked; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes, or (ii) no further sub-haplotypes having more unmasked sites than a pre selected limit can be generated. - View Dependent Claims (23, 24)
-
-
25. A computer programmed to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlates with an outcome of interest, the computer comprising a memory having at least one region for storing computer executable program code and a processor for executing the program code stored in memory wherein the method comprises:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each individual polymorphism in the haplotype for the degree to which it correlates with the outcome of interest and calculating a numerical measure of the degree of correlation;
(c) saving each individual polymorphism whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) generating all possible pair-wise combinations of the saved individual polymorphism so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the numerical measure of the degree of correlation;
(f) saving each n-site sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion ad its numerical measure of the degree of correlation;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated. - View Dependent Claims (26, 27, 28)
-
-
29. A computer programmed to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, the computer comprising a memory having at least one region for storing computer executable program code and a processor for executing the program code stored in memory, wherein the method comprises:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymeric sites;
(b) statistically analyzing each individual polymorphism in the haplotype for the degree to which it correlates with the outcome of interest and calculating a p-value for the correlation;
(c) saving each individual polymorphism whose p-value does not exceed a first cut-off value and its p-value;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statistically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the p-value for the correlation;
(f) saving each n-site sub-haplotype whose p-value does not exceed the first cut-off value and its p-value;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes can be generated, or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated.
-
-
30. A computer programmed to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, the computer comprising a memory having at least one region for storing computer executable program code and a processor for executing the program code stored in memory, wherein the method comprises:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each haplotype for the degree to which it correlates with the outcome of interest, and calculating a numerical measure of the degree of correlation;
(c) saving each haplotype whose numerical measure of the degree of correlation meets a first cut-off criterion and its numerical measure of the degree of correlation;
(d) generating, from each haplotype composed of m polymorphic sites, all possible sub-haplotypes having a single site masked, so as to provide a set of m-n site sub-haplotypes where n=1;
(e) statistically analyzing each newly generated sub-haplotype for the degree to which it correlates with the outcome of interest, and calculating the numerical measure of the degree of correlation;
(f) saving each sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion and its numerical measure of the degree of correlation;
(g) generating, form the saved sub-haplotypes, all possible sub-haplotypes having one additional site masked; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes, or (ii) no further sub-haplotypes having more unmasked sites than a pre-selected limit can be generated. - View Dependent Claims (31, 34)
-
-
32. A computer programmed to execute a method to determine polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, the computer comprising a memory having at least one region for storing computer executable program code and a processor for executing the program code stored in memory, wherein the method comprises:
-
(a) accessing a database comprising a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each haplotype for the degree to which it correlates with to outcome of interest, and calculating a p-value for the correlation;
(c) saving each haplotype whose p-value does not exceed a first cut-off value and its p-value;
(d) generating, from each haplotype composed of m polymorphic sites, all possible sub-haplotypes having a single site masked, so as to provide a set of m-n site sub-haplotypes where n=1;
(e) statistically analyzing each newly generated sub-haplotype for the degree to which it correlates with the outcome of interest, and calculating the p-value for the correlation;
(f) saving each sub-haplotype whose p-value does not exceed the first cut-off value and its p-value;
(g) generating, from the saved sub-haplotypes, all possible sub-haplotypes having one additional site masked; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes or (ii) no further sub-haplotypes having more unmasked sites than a pre-selected limit can be generated. - View Dependent Claims (33)
-
-
35. A method of determining polymorphic sites or sub-haplotypes for a locus that correlate with an outcome of interest, comprising:
-
(a) providing a haplotype for each allele of the locus for each subject in a cohort of subjects and an outcome value for the outcome of interest for each subject in the cohort of subjects, the locus comprising at least two polymorphic sites;
(b) statistically analyzing each individual polymorphism in the haplotypes for the degree to which it correlates with the outcome of interest and calculating a numerical measure of the degree of correlation;
(c) saving each individual polymorphism whose numerical measure of the degree of correlation meets a first cut-off criterion for statistical significance;
(d) generating all possible pair-wise combinations of the saved individual polymorphisms so as to provide a set of n-site sub-haplotypes where n=2;
(e) statically analyzing each newly generated n-site sub-haplotype for the degree to which it correlates with the outcome of interest and calculating the numerical measure of the degree of correlation;
(f) saving each n-site sub-haplotype whose numerical measure of the degree of correlation meets the first cut-off criterion;
(g) generating all possible pair-wise combinations among and between the saved individual polymorphisms and saved sub-haplotypes, to produce new sub-haplotypes with increased values of n; and
(h) repeating steps (e) through (g) until either (i) no new sub-haplotypes or (ii) no new sub-haplotypes having n less than a pre-selected limit can be generated. - View Dependent Claims (36, 37)
-
Specification