Selecting tag nucleic acids
First Claim
Patent Images
1. A method for selecting a set of tag nucleic acids, said method comprising:
- (a) providing a plurality of candidate tag nucleic acid sequences, wherein the total number of nucleotides in each tag nucleic acid sequence is identical, the number of G+C nucleotides in each tag nucleic acid sequence is identical and the overall number of A+G nucleotides in each of the tag nucleic acids sequences is even;
(b) aligning a first candidate tag with each of the other candidate tags in the plurality;
(c) excluding the first candidate tag from the plurality if it can be aligned with less than two errors to another candidate tag, wherein an error is either a mismatch hybridization or an overlapping nucleotide, (d) repeating (b) and (c) for each of the remaining candidate tags in the plurality, thereby providing a set of nucleic acid tags; and
(e) synthesizing at least 100 members of the set of nucleic acid tags.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods of selecting tag nucleic acids and VLSIPS™ arrays and the arrays made by the methods are used to label and track compositions, including cells and viruses, e.g., in libraries of cells or viruses. In addition to providing a way of tracking compositions in mixtures, the tags facilitate analysis of cell and viral phenotypes.
282 Citations
10 Claims
-
1. A method for selecting a set of tag nucleic acids, said method comprising:
-
(a) providing a plurality of candidate tag nucleic acid sequences, wherein the total number of nucleotides in each tag nucleic acid sequence is identical, the number of G+C nucleotides in each tag nucleic acid sequence is identical and the overall number of A+G nucleotides in each of the tag nucleic acids sequences is even;
(b) aligning a first candidate tag with each of the other candidate tags in the plurality;
(c) excluding the first candidate tag from the plurality if it can be aligned with less than two errors to another candidate tag, wherein an error is either a mismatch hybridization or an overlapping nucleotide, (d) repeating (b) and (c) for each of the remaining candidate tags in the plurality, thereby providing a set of nucleic acid tags; and
(e) synthesizing at least 100 members of the set of nucleic acid tags.
-
-
2. A method for selecting a set of tag nucleic acids, wherein each tag nucleic acid sequence comprises a constant region and a variable region, said method comprising:
-
(a) providing a plurality of candidate tag nucleic acid sequences;
(b) aligning a selected candidate tag with each of the other candidate tags in the plurality;
(c) excluding the selected candidate tag if it can be aligned with less than two errors in the variable region to another candidate tag, wherein an error is either a mismatch hybridization or an overlapping nucleotide;
(d) repeating (b) and (c) for each of the remaining candidate tags in the plurality, thereby providing a set of nucleic acid tags; and
(e) synthesizing members of the set of nucleic acid tags.
-
-
3. A composition comprising a set of tag nucleic acids to individually detect members of a plurality of tagged components in a mixture of tagged components, which set of tag nucleic acid sequences comprises a plurality of at least 100 different tag nucleic acid sequences, which tag nucleic acid sequences comprise a variable region and a constant region;
-
which variable region for each tag nucleic acid sequence in the set of tag nucleic acid sequences has the same Tm, the same G+C to A+T ratio, the same length, does not cross-hybridize under stringent conditions to any other tag nucleic acid sequence variable region, and does not cross-hybridize under stringent conditions to the complement of any other tag nucleic acid sequence variable region;
wherein the variable regions of the tag nucleic acid sequence in the set of tag nucleic acid sequences cannot be aligned with less than two differences between any two of the tag nucleic acid sequences in the set of tag nucleic acid sequences, and further wherein each member of the set of nucleic acid tags is comprised of a single nucleic acid sequence from the set of nucleic acid tags. - View Dependent Claims (4, 5)
-
-
6. A set of at least 100 different tag nucleic acids selected according to the mothod of 104.
-
7. A set of at least 100 different tag nucleic acids wherein each member of the set:
-
(a) does not have a region of complementarity with any other member of the set greater than four nucleotides;
(b) does not have a selected length of contiguous nucleotides in common with any other tag nucleic acid sequence in the set, wherein the length is selected from the following;
5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15;
(c) has greater than 5 differences out of 20 nucleotides when aligned for maximal sequence correspondence with any other tag nucleic acid sequence in the set, and (d) does not have four contiguous nucleotides selected from the group consisting of four X residues, four Y residues and four Z residues, wherein X is G or C, Y is G or A, and Z is A or T, and (e) each member of the set of tag nucleic acids is comprised of a single nucleic acid sequence from the set of nucleic acid tags.
-
-
8. A set of at least 100 different tag nucleic acids wherein:
-
(a) no two members of the set can be aligned with less than two errors, wherein an error is either a mismatch hybridization or an overhanging nucleotide;
(b) the total number of nucleotides in each tag nucleic acid sequence is identical;
(c) the number of G+C nucleotides in each tag nucleic acid sequence is identical;
(d) the overall number of A+G nucleotides in each of the of the tag nucleic acid sequences is even; and
(e) each member of the set of tag nucleic acids is comprised of a single nucleic acid sequence from the set of nucleic acid tags. - View Dependent Claims (9, 10)
(a) the total number of nucleotides in each tag nucleic acid sequence is identical;
(b) the number of G+C nucleotides in each tag nucleic acid sequence is identical; and
,(c) the overall number of A+G nucleotides in each of the variable regions of the tag nucleic acid sequences is even, and (d) each member of the set of tag nucleic acids is comprised of a single nucleic acid sequence from the set of nucleic acid tags.
-
Specification