Systems and methods for probe design to detect the presence of simple and complex indels
First Claim
Patent Images
1. A method of providing a genotyping array for genotyping a plurality of complex or multi-base indels within one or more sample genomes, the method comprising:
- accessing a data set of one or more genome databases, said data set comprising information regarding genomic locations of the plurality of complex or multi-base indels (insertion/deletion) and sequences flanking at least one direction of the indels;
designing a plurality of indel probes to generate a plurality of designed indel probes for inclusion on a genotyping array by using one or more computer processors coupled to one or more memories configured to provide the one or more computer processors with instructions to, for each complex or multi-base indel of the plurality of complex or multi-base indels;
(a) access or generate a probe sequence that is complementary to a target sequence flanking the complex or multi-base indel;
(b) compare a deletion allele interrogation base to an insertion allele interrogation base in an indel identification sequence, the deletion allele interrogation base and the insertion allele interrogation base being referred to collectively as the interrogation bases, and,(c) in response to the interrogation bases being different and being in different detection channels, create and output a single indel probe adapted for using the interrogation bases to differentiate the deletion allele and the insertion allele; and
(d) in response to the compared interrogation bases being a same interrogation base, incorporate the same interrogation base into the probe sequence and set new interrogation bases to be compared based on a next downstream base in a deletion allele sequence and a next downstream base in an insertion allele sequence and repeat (b)-(c) until either an interrogation distance threshold would be exceeded or the interrogation bases are different and are in different detection channels, and, when the interrogation bases are different and are in different detection channels, create and output a single indel probe adapted for using the interrogation bases to differentiate the deletion allele and the insertion allele;
synthesizing a plurality of indel probes of the genotyping array according to the plurality of designed indel probes; and
providing, with the genotyping array, labelled nucleotide bases corresponding to the interrogation bases for adding to a corresponding probe of the plurality of synthesized probes after the corresponding probe is hybridized with a sample during a genotyping assay, interrogation bases corresponding to some of the plurality of synthesized indel probes having different interrogation distances than do interrogation bases corresponding to others of the plurality of synthesized indel probes, a complex or multi-base indel of the plurality of complex or multi-base indels being genotyped by detecting labels corresponding to the labelled nucleotide bases.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for the determination of a collection of relevant single nucleotide polymorphisms (SNP) probe compatible insertion/deletion probes across a genome to determine probes that can detect a variety of insertions and deletions.
45 Citations
15 Claims
-
1. A method of providing a genotyping array for genotyping a plurality of complex or multi-base indels within one or more sample genomes, the method comprising:
-
accessing a data set of one or more genome databases, said data set comprising information regarding genomic locations of the plurality of complex or multi-base indels (insertion/deletion) and sequences flanking at least one direction of the indels; designing a plurality of indel probes to generate a plurality of designed indel probes for inclusion on a genotyping array by using one or more computer processors coupled to one or more memories configured to provide the one or more computer processors with instructions to, for each complex or multi-base indel of the plurality of complex or multi-base indels; (a) access or generate a probe sequence that is complementary to a target sequence flanking the complex or multi-base indel; (b) compare a deletion allele interrogation base to an insertion allele interrogation base in an indel identification sequence, the deletion allele interrogation base and the insertion allele interrogation base being referred to collectively as the interrogation bases, and, (c) in response to the interrogation bases being different and being in different detection channels, create and output a single indel probe adapted for using the interrogation bases to differentiate the deletion allele and the insertion allele; and (d) in response to the compared interrogation bases being a same interrogation base, incorporate the same interrogation base into the probe sequence and set new interrogation bases to be compared based on a next downstream base in a deletion allele sequence and a next downstream base in an insertion allele sequence and repeat (b)-(c) until either an interrogation distance threshold would be exceeded or the interrogation bases are different and are in different detection channels, and, when the interrogation bases are different and are in different detection channels, create and output a single indel probe adapted for using the interrogation bases to differentiate the deletion allele and the insertion allele; synthesizing a plurality of indel probes of the genotyping array according to the plurality of designed indel probes; and providing, with the genotyping array, labelled nucleotide bases corresponding to the interrogation bases for adding to a corresponding probe of the plurality of synthesized probes after the corresponding probe is hybridized with a sample during a genotyping assay, interrogation bases corresponding to some of the plurality of synthesized indel probes having different interrogation distances than do interrogation bases corresponding to others of the plurality of synthesized indel probes, a complex or multi-base indel of the plurality of complex or multi-base indels being genotyped by detecting labels corresponding to the labelled nucleotide bases. - View Dependent Claims (2, 3)
-
-
4. A method for genotyping a plurality of complex or multi-base indels within one or more sample genomes comprising:
-
for each synthesized indel probe of a plurality of synthesized indel probes, adding one or more nucleotides or oligonucleotides including a label to the synthesized indel probe after hybridization with a sample, the one or more nucleotides or oligonucleotides being added corresponding to an interrogation base; and genotyping a complex or multi-base indel of the plurality of complex or multi-base indels by detecting a respective label corresponding to a respective one or more nucleotides or oligonucleotides added to a respective synthesized indel probe of the plurality of synthesized indel probes; wherein the plurality of synthesized probes have been designed by steps comprising; accessing a data set of one or more genome databases, said data set comprising information regarding genomic locations of the one or more complex or multi-base indels (insertion/deletion) and sequences flanking at least one direction of the indels; for each of the plurality of indel probes to be designed, designing the indel probe by; (a) generating a probe sequence that is complementary to a target sequence flanking the complex or multi-base indel, and comparing interrogation bases in an indel identification sequence for a deletion allele probe and for an insertion allele probe; and (b) if the compared interrogation bases are different and are in different detection channels, creating and outputting a single indel probe using the interrogation bases to differentiate the indel alleles and END; (c) if the compared interrogation bases are different and in the same detection channel, EXIT (a)-(c); (d) if the compared interrogation bases are the same, repeating beginning at step (a), but setting the interrogation base one base downstream of the previously compared interrogation base as a current compared interrogation base and increasing an interrogation distance counter by one until a base after the indel identification sequence differs from a prior base of the indel and when the interrogation bases, after the downstream setting, are different and are in different detection channels, creating and outputting a single indel probe using the interrogation bases to differentiate the deletion allele and the insertion allele; and (e) repeating steps a-c until (b OR c) is TRUE or until an interrogation distance threshold is exceeded; wherein designing results in a plurality of designed indel probes such that interrogation bases corresponding to some of the plurality of designed indel probes have different interrogation distances than do interrogation bases corresponding to others of the plurality of designed indel probes. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification