Methods for high throughput genotyping
First Claim
1. A method for calling the genotype of a sample at a selected polymorphism in a sample using a genotyping array, comprising:
- (a) obtaining intensity measurements for allele A and for allele B for a plurality of polymorphisms in a plurality of training samples, wherein the genotype of each polymorphism in the plurality in each training sample is known, wherein the intensity measurements represent intensity of signal associated with one or more features on said genotyping array;
(b) making a genotype call for each of a said polymorphisms in each of the training samples using the intensity measurements for allele A and for allele B obtained in (a);
(c) comparing the genotype call with the known genotype to identify individuals where the correct genotype call was made;
(d) using the intensity measurements from the individuals identified in (c) to calculate a ratio of intensity measurement for allele A to intensity measurement for allele B, for the training samples for each sub-group of AA, AB and BB to obtain an AA reference ratio, an AB reference ratio and a BB reference ratio for each of said polymorphisms;
(e) hybridizing a test sample to a second copy of the genotyping array to obtain hybridization intensity values for the A allele and for the B allele for each of said polymorphisms in the test sample;
(f) calculating a ratio of the intensity measurement for the A allele to the B allele for each of said polymorphisms in the test sample and comparing the ratio to the reference ratios for AA, AB and BB obtained for that polymorphism in (d) to obtain a p-value that the polymorphism is either AA, AB, or BB;
(g) identifying a subset of the polymorphisms in the test sample that are likely to be AB, wherein a polymorphism is identified as being likely to be AB if the p-value that the polymorphism is AB is greater than 0.4;
(h) adjusting the intensity measurement of the B allele by the reference ratio for the AB group for that polymorphism from the training set to obtain an adjusted intensity measurement for the B allele, for each polymorphism in the subset of polymorphisms identified in (g); and
(i) generating a genotype call for each polymorphisms identified in (g) using the adjusted intensity measurement for the B allele.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods for genotyping polymorphisms using allele specific probes are disclosed. A training set is used to generate a model for each polymorphism to be interrogated. The training set is used to obtain an estimate of the asymmetry between an intensity measurement for a first allele and an intensity measurement for a second allele of the same polymorphism. The intensity measurement obtained for a test sample is adjusted using the estimate of asymmetry prior to using the intensity measurements to make a genotyping call. In preferred embodiments the adjustment is applied to polymorphisms that have a likelihood of being heterozygous that is above a specified threshold.
-
Citations
13 Claims
-
1. A method for calling the genotype of a sample at a selected polymorphism in a sample using a genotyping array, comprising:
-
(a) obtaining intensity measurements for allele A and for allele B for a plurality of polymorphisms in a plurality of training samples, wherein the genotype of each polymorphism in the plurality in each training sample is known, wherein the intensity measurements represent intensity of signal associated with one or more features on said genotyping array; (b) making a genotype call for each of a said polymorphisms in each of the training samples using the intensity measurements for allele A and for allele B obtained in (a); (c) comparing the genotype call with the known genotype to identify individuals where the correct genotype call was made; (d) using the intensity measurements from the individuals identified in (c) to calculate a ratio of intensity measurement for allele A to intensity measurement for allele B, for the training samples for each sub-group of AA, AB and BB to obtain an AA reference ratio, an AB reference ratio and a BB reference ratio for each of said polymorphisms; (e) hybridizing a test sample to a second copy of the genotyping array to obtain hybridization intensity values for the A allele and for the B allele for each of said polymorphisms in the test sample; (f) calculating a ratio of the intensity measurement for the A allele to the B allele for each of said polymorphisms in the test sample and comparing the ratio to the reference ratios for AA, AB and BB obtained for that polymorphism in (d) to obtain a p-value that the polymorphism is either AA, AB, or BB; (g) identifying a subset of the polymorphisms in the test sample that are likely to be AB, wherein a polymorphism is identified as being likely to be AB if the p-value that the polymorphism is AB is greater than 0.4; (h) adjusting the intensity measurement of the B allele by the reference ratio for the AB group for that polymorphism from the training set to obtain an adjusted intensity measurement for the B allele, for each polymorphism in the subset of polymorphisms identified in (g); and (i) generating a genotype call for each polymorphisms identified in (g) using the adjusted intensity measurement for the B allele. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for genotyping a polymorphism in a test individual comprising:
-
(a) obtaining allele specific intensity measurements for a first allele of the polymorphism and for a second allele of the polymorphism in a training set comprising a plurality of individuals of known genotype for said polymorphism; (b) grouping the individuals in the training set into groups according to genotype, wherein each individual is assigned to one of the following groups;
homozygous for the first allele, homozygous for the second allele or heterozygous;(c) calculating a summary ratio of the intensity measurements for said first allele to the intensity measurements for said second allele for each group to obtain a first summary ratio for the group that are homozygous for said first allele, a second summary ratio for the group that are homozygous for said second allele and a third summary ratio for the group that are heterozygous; (d) obtaining an allele specific intensity for said first allele and for said second allele in said test individual and calculating a test ratio for said test individual; (e) adjusting the allele specific intensity value for said first allele according to the third summary ratio obtained in (c) if said test ratio is closer to the third summary ratio than to the first or second summary ratios; and (e) making a genotype call for said polymorphism using the adjusted value obtained in (e) for said first allele and the intensity value obtained in (d) for the second allele, wherein steps (b), (c) and (e) are performed by a computer and wherein the computer outputs the genotype call for said polymorphism in a readable format. - View Dependent Claims (8)
-
-
9. A method for calling the genotype of a sample at a selected polymorphism in a sample using a genotyping array, comprising:
-
(a) obtaining hybridization intensity values for said genotyping array for each of a set of training samples comprising a plurality of training samples of known genotype; (b) making a genotype call for each of a plurality of SNPs in the each of the training samples using the hybridization intensity values from individual probe quartets; (c) comparing the genotype call with the known genotype for each probe quartet to identify a plurality of K best probe quartets for each SNP, where K is at least 1, wherein probe quartets are selected as best probe quartets if the genotype call made using said quartet has high concordance with the known genotype for that SNP; (d) calculating a distribution of (intensity A)/(intensity B) distribution for the training samples for each sub-group of AA, AB and BB to obtain an AA reference distribution, an AB reference distribution and a BB reference distribution; (e) hybridizing a test sample to the genotyping array to obtain hybridization intensity values for said K best probe quartets for the selected polymorphism; calculating (intensity A)/(intensity B) for each of the K best probe quartets for the selected polymorphism to obtain a ratio value for each and comparing each ratio value with the AA reference distribution, the AB reference distribution and the BB reference distribution for the polymorphism to obtain a p-value that the selected polymorphism is AB; (g) adjusting the intensity of intensity B by the (intensity A)/(intensity B) ratio from the AB group from the reference set to obtain an adjusted allele B intensity if the p-value that the selected polymorphism is AB is greater than 0.4; and (h) the adjusted intensity B value to generate a genotype call for the selected polymorphism using a selected algorithm. - View Dependent Claims (10, 11, 12, 13)
-
Specification