Accelerating identification of single nucleotide polymorphisms and alignment of clones in genomic sequencing
First Claim
1. A method of assembling a map of an organism'"'"'s genome or portions thereof comprising:
- providing a library of an organism'"'"'s DNA, wherein individual genomic segments are found in more than one clone in the library with at least some of the said individual genomic segments have overlapping DNA sequences;
creating representations of the genomic segments in individual clones by selecting a subpopulation of genomic segments out of a larger set of the genomic segments in each of the individual clones, wherein said selecting a subpopulation of genomic segments comprises;
subjecting more than one individual clone to a restriction endonuclease, wherein the restriction endonuclease is effective in recognizing a restriction site recognition sequence and cleaving the DNA in said more than one individual clone at a restriction site of the restriction site recognition sequence thereby creating a plurality of clone fragments having 2 base overhangs and adding 1 to 12 linker-adapters, each of which is non-palindromic, to the overhangs in the presence of ligase and the restriction endonuclease and producing a plurality of clone fragments comprising said 1 to 12 linker-adapters, wherein said plurality of clones fragments comprising said 1 to 12 linker-adapters are selected as the representations, wherein the linker-adapters contain single stranded overhangs of a formula NN/N′
N′
where said NN/N′
N′
is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC;
generating DNA sequence information from the representations;
analyzing the DNA sequence information thereby determining clone overlap from the representations; and
combining said clone overlap and DNA sequence information from the representations thereby assembling a map of the organism'"'"'s genome or portions thereof.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is directed to a method of assembling genomic maps of an organism'"'"'s DNA or portions thereof. A library of an organism'"'"'s DNA is provided where the individual genomic segments or sequences are found on more than one clone in the library. Representations of the genome are created, and nucleic acid sequence information is generated from the representations. The sequence information is analyzed to determine clone overlap from a representation. The clone overlap and sequence information from different representations is combined to assemble a genomic map of the organism. Once the genomic map is obtained, genomic sequence information from multiple individuals can be applied to the map and compared with one another to identify single nucleotide polymorphisms. These single nucleotide polymorphisms can be detected, and alleles quantified, by conducting (1) a global PCR amplification which creates a genome representation, and (2) a ligation detection reaction process whose ligation products are captured by hybridization to a support.
-
Citations
20 Claims
-
1. A method of assembling a map of an organism'"'"'s genome or portions thereof comprising:
-
providing a library of an organism'"'"'s DNA, wherein individual genomic segments are found in more than one clone in the library with at least some of the said individual genomic segments have overlapping DNA sequences; creating representations of the genomic segments in individual clones by selecting a subpopulation of genomic segments out of a larger set of the genomic segments in each of the individual clones, wherein said selecting a subpopulation of genomic segments comprises;
subjecting more than one individual clone to a restriction endonuclease, wherein the restriction endonuclease is effective in recognizing a restriction site recognition sequence and cleaving the DNA in said more than one individual clone at a restriction site of the restriction site recognition sequence thereby creating a plurality of clone fragments having 2 base overhangs and adding 1 to 12 linker-adapters, each of which is non-palindromic, to the overhangs in the presence of ligase and the restriction endonuclease and producing a plurality of clone fragments comprising said 1 to 12 linker-adapters, wherein said plurality of clones fragments comprising said 1 to 12 linker-adapters are selected as the representations, wherein the linker-adapters contain single stranded overhangs of a formula NN/N′
N′
where said NN/N′
N′
is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC;generating DNA sequence information from the representations; analyzing the DNA sequence information thereby determining clone overlap from the representations; and combining said clone overlap and DNA sequence information from the representations thereby assembling a map of the organism'"'"'s genome or portions thereof. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of assembling a map of an organism'"'"'s genome or portions thereof comprising:
-
providing a library of an organism'"'"'s DNA, wherein individual genomic segments are found in more than one clone in the library; creating representations of the genomic segments in individual clones by selecting a subpopulation of genomic segments out of a larger set of the genomic segments in each of the individual clones, wherein said selecting a subpopulation of genomic segments comprises;
subjecting more than one individual clone to a restriction endonuclease, wherein the restriction endonuclease is effective in recognizing a restriction site recognition sequence and cleaving the DNA in said more than one individual clone at a restriction site of the restriction site recognition sequence thereby creating a plurality of clone fragments having 2 base overhangs and adding linker-adapters, each of which is non-palindromic, to the overhangs in the presence of ligase and the first restriction endonuclease and producing a plurality of clone fragments comprising said linker-adapters, wherein said plurality of clone fragments comprising said linker-adapters are selected as the representations, wherein the linker-adapters contain single stranded overhangs of a formula NN/N′
N′
where said NN/N′
N′
is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC;generating DNA sequence information from the representations; analyzing the DNA sequence information thereby determining clone overlap from the representations, wherein said analyzing the DNA sequence information comprises;
analyzing sequencing data generated by deconvoluting one or more singlet, doublet and/or triplet sequences contained in the representations; andcombining said clone overlap and DNA sequence information from the representations thereby assembling a map of the organism'"'"'s genome or portions thereof. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A method of assembling a map of an organism'"'"'s genome or portions thereof comprising:
-
providing a library of an organism'"'"'s DNA, wherein individual genomic segments are found in more than one clone in the library with at least some of the said individual genomic segments have overlapping DNA sequences; creating representations of the genomic segments in individual clones by selecting a subpopulation of genomic segments out of a larger set of the genomic segments in each of the individual clones, wherein said selecting a subpopulation of genomic segments comprises;
subjecting more than one individual clone to a restriction endonuclease, wherein the restriction endonuclease is effective in recognizing a restriction site recognition sequence and cleaving the DNA in said more than one individual clone at a restriction site of the restriction site recognition sequence thereby creating a plurality of clone fragments having an overhang; and
adding a linker-adapter which is non-palindromic to the overhang in the presence of ligase and the restriction endonuclease and producing a plurality clone fragments comprising said linker-adapters, wherein said plurality of clone fragments comprising said linker-adapters are selected as the representations, wherein the linker-adapters contain single stranded overhangs of a formula NN/N′
N′
where NN/N′
N′
is selected from the group consisting of AA/TT, AC/GT, AG/CT, CA/TG, GA/TC, and GG/CC;generating DNA sequence information from the representations; analyzing the DNA sequence information thereby determining clone overlap from the representations; and combining said clone overlap and DNA sequence information from the representations thereby assembling a map of the organism'"'"'s genome or portions thereof, wherein said combining said clone overlap and DNA sequence information comprises;
comparing the DNA sequence information in a pair of said plurality of clone fragments comprising said linker-adaptors in the representations which have identical contiguous portions.
-
Specification