Efficient methods and apparatus for high-throughput processing of gene sequence data
First Claim
1. A method of processing gene sequence data with use of one or more computers, the method comprising:
- reading, by the computer, gene sequence data corresponding to a gene sequence and coding sequence data corresponding to a plurality of coding sequences within the gene sequence;
identifying, by the computer following a set of primer selection rules, primer pair data within the gene sequence data, the primer pair data corresponding to a pair of primer sequences for one of the coding sequences,the set of primer selection rules including a first rule specifying that the primer pair data for the coding sequence be obtained for a predetermined annealing temperature;
the set of primer selection rules including a second rule specifying that, based on a comparison of the primer pair data and gene family data, wherein the gene family data represents a gene family member of the gene sequence other than the gene sequence, stored in a file, the primer pair data for the coding sequence must fail to match the gene family data;
storing the primer pair data;
repeating the acts of identifying and storing such that primer pair data are obtained for each coding sequence of the plurality of coding sequences at the predetermined annealing temperature; and
simultaneously amplifying the plurality of coding sequences in gene sequences from three or more individuals at the predetermined annealing temperature using the identified pairs of primer sequences, such that a plurality of amplified coding sequences from the three or more individuals are obtained.
4 Assignments
0 Petitions
Accused Products
Abstract
One disclosed method of processing gene sequence data includes the steps of reading gene sequence data corresponding to a gene sequence and coding sequence data corresponding to a plurality of coding sequences within the gene sequence; identifying and storing, by following a set of primer selection rules, primer pair data within the gene sequence data for one of the coding sequences; repeating the acts of identifying and storing such that primer pair data are obtained for each sequence of the plurality of coding sequences; and simultaneously amplifying the plurality of coding sequences in gene sequences from three or more of individuals using the identified pairs of primer sequences. The set of primer selection rules include a rule specifying that all of the primer pair data for the plurality of coding sequences be obtained for a predetermined annealing temperature, which allows for the subsequent simultaneous amplification of sequences from hundreds of individuals in a single amplification run.
-
Citations
26 Claims
-
1. A method of processing gene sequence data with use of one or more computers, the method comprising:
-
reading, by the computer, gene sequence data corresponding to a gene sequence and coding sequence data corresponding to a plurality of coding sequences within the gene sequence; identifying, by the computer following a set of primer selection rules, primer pair data within the gene sequence data, the primer pair data corresponding to a pair of primer sequences for one of the coding sequences, the set of primer selection rules including a first rule specifying that the primer pair data for the coding sequence be obtained for a predetermined annealing temperature; the set of primer selection rules including a second rule specifying that, based on a comparison of the primer pair data and gene family data, wherein the gene family data represents a gene family member of the gene sequence other than the gene sequence, stored in a file, the primer pair data for the coding sequence must fail to match the gene family data; storing the primer pair data; repeating the acts of identifying and storing such that primer pair data are obtained for each coding sequence of the plurality of coding sequences at the predetermined annealing temperature; and simultaneously amplifying the plurality of coding sequences in gene sequences from three or more individuals at the predetermined annealing temperature using the identified pairs of primer sequences, such that a plurality of amplified coding sequences from the three or more individuals are obtained. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer program product comprising:
-
a computer-usable storage medium; computer-readable program code embodied on said computer-usable storage medium; and the computer-readable program code for effecting the following acts on a computer; reading gene sequence data corresponding to a gene sequence and coding sequence data corresponding to a plurality of coding sequences within the gene sequence; identifying primer pair data within the gene sequence data by following a set of primer selection rules, the primer pair data corresponding to a pair of primer sequences for one of the coding sequences, the set of primer selection rules including a first rule specifying that the primer pair data for the coding sequence be obtained for a predetermined annealing temperature; the set of primer selection rules including a second rule specifying that, based on a comparison of the primer pair data and gene family data, wherein the gene family data represents a gene family member of the gene sequence other than the gene sequence, stored in a file, the primer pair data for the coding sequence must fail to match the gene family data; storing the primer pair data; and repeating the acts of identifying and storing such that primer pair data are obtained for each coding sequence of the plurality of coding sequences at the predetermined annealing temperature, so that the plurality of coding sequences can be simultaneously amplified in gene sequences from three or more of individuals at the predetermined annealing temperature using the identified pairs of primer sequences to produce a plurality of amplified coding sequences from the three or more individuals. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
Specification