Computer-aided nucleic acid sequencing
First Claim
Patent Images
1. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
- inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
the computer selecting nucleic acid probes from the plurality of nucleic acid probes, wherein the step of selecting nucleic acid probes includes selecting nucleic acid probes with highest numbers of single base mismatch neighbors among the probes, a single base mismatch neighbor being another probe that has the same sequence except for a single base that is different; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes.
5 Assignments
0 Petitions
Accused Products
Abstract
A computer system for sequencing nucleic acids is provided. The computer system may be used for de novo sequencing of a nucleic acid sequence by analyzing the fluorescence intensities of hybridized nucleic acid probes on biological chips. The probes with the highest intensities are utilized to sequence the nucleic acid and related probes are analyzed to increase the accuracy of nucleic acid sequencing. The sequence of the nucleic acid sequence may be determined from hybridization intensities that do not allow identification of perfectly complementary probes.
83 Citations
42 Claims
-
1. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
the computer selecting nucleic acid probes from the plurality of nucleic acid probes, wherein the step of selecting nucleic acid probes includes selecting nucleic acid probes with highest numbers of single base mismatch neighbors among the probes, a single base mismatch neighbor being another probe that has the same sequence except for a single base that is different; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes.
-
-
2. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
the computer selecting nucleic acid probes from the plurality of nucleic acid probes, wherein the step of selecting nucleic acid probes includes selecting nucleic acid probes that have fewer than a predetermined number of base mismatches with another probe; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes.
-
-
3. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the-nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
the computer selecting nucleic acid probes from the plurality of nucleic acid probes, wherein the step of selecting nucleic acid probes includes selecting nucleic acid probes with highest hybridization intensities and the nucleic acid probes with the highest hybridization intensities are identified as having hybridization intensities above an intensity threshold; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes.
-
-
4. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
the computer selecting nucleic acid probes from the plurality of nucleic acid probes; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes, wherein the sequencing step includes the steps of;
aligning the selected nucleic acid probes so that bases that are common overlap; and
sequencing the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes. - View Dependent Claims (5)
-
-
6. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence;
the computer selecting nucleic acid probes with highest numbers of single base mismatch neighbors among the probes, a single base mismatch neighbor being another probe that has the same sequence except for a single base that is different; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes. - View Dependent Claims (7, 8, 9, 10, 11, 12)
aligning the selected nucleic acid probes so that bases that are common overlap; and
sequencing the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes.
-
-
12. The method of claim 11, wherein the nucleic acid sequence is sequenced as complementary to the bases that occur most often.
-
13. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence;
code that selects nucleic acid probes with highest numbers of single base mismatch neighbors among the probes, a single base mismatch neighbor being another probe that has the same sequence except for a single base that is different;
code that sequences the nucleic acid sequence according to the selected nucleic acid probes; and
a computer readable storage medium that stores the codes. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
code that aligns the selected nucleic acid probes so that bases that are common overlap; and
code that sequences the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes.
-
-
20. The computer program product of claim 19, wherein the nucleic acid sequence is sequenced as complementary to the bases that occur most often.
-
21. A computer implemented method of sequencing a nucleic acid sequence, the method comprising the steps of:
-
inputting hybridization intensities into a computer for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence;
the computer selecting nucleic acid probes that have fewer than a predetermined number of base mismatches with another probe; and
the computer sequencing the nucleic acid sequence according to the selected nucleic acid probes. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
comparing bases at positions 1 through n of a first nucleic acid probe to bases at positions 1 through n of a second nucleic acid probe; and
counting base mismatches between the first and second nucleic acid probe.
-
-
26. The method of claim 24, further comprising the steps of:
-
comparing bases at positions 1 through n−
1 of a first nucleic acid probe to bases at positions 2 through n of a second nucleic acid probe; and
counting base mismatches between the first and second nucleic acid probe.
-
-
27. The method of claim 21, wherein the sequencing step includes the steps of:
-
aligning the selected nucleic acid probes so that bases that are common overlap; and
sequencing the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes.
-
-
28. The method of claim 27, wherein the nucleic acid sequence is sequenced as complementary to the bases that occur most often.
-
29. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence;
code that selects nucleic acid probes that have fewer than a predetermined number of base mismatches with another probe;
code that sequences the nucleic acid sequence according to the selected nucleic acid probes; and
a computer readable storage medium that stores the codes. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37)
code that compares bases at positions 1 through n of a first nucleic acid probe to bases at positions 1 through n of a second nucleic acid probe; and
code that counts base mismatches between the first and second nucleic acid probe.
-
-
35. The computer program product of claim 33, further comprising:
-
code that compares bases at positions 1 through n−
1 of a first nucleic acid probe to bases at positions 2 through n of a second nucleic acid probe; and
code that counts base mismatches between the first and second nucleic acid probe.
-
-
36. The computer program product of claim 29, wherein the code that sequences includes:
-
code that aligns the selected nucleic acid probes so that bases that are common overlap; and
code that sequences the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes.
-
-
37. The computer program product of claim 36, wherein the nucleic acid sequence is sequenced as complementary to the bases that occur most often.
-
38. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
code that selects nucleic acid probes from the plurality of nucleic acid probes, wherein the code that selects nucleic acid probes includes code that selects nucleic acid probes with highest numbers of single base mismatch neighbors among the probes, a single base mismatch neighbor being another probe that has the same sequence except for a single base that is different;
code that sequences and nucleic acid sequence according to the selected nucleic acid probes; and
a computer readable storage medium that stores the codes.
-
-
39. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
code that selects nucleic acid probes from the plurality of nucleic acid probes, wherein the code that selects nucleic acid probes includes code that selects nucleic acid probes that have fewer than a predetermined number of base mismatches with another probe;
code that sequences and nucleic acid sequence according to the selected nucleic acid probes; and
a computer readable storage medium that stores the codes.
-
-
40. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
code that selects nucleic acid probes from the plurality of nucleic acid probes, wherein the code that selects nucleic acid probes includes code that selects nucleic acid probes with highest hybridization intensities and the nucleic acid probes with the highest hybridization intensities are identified as having hybridization intensities above an intensity threshold;
code that sequences and nucleic acid sequence according to the selected nucleic acid probes; and
a computer readable storage medium that stores the codes.
-
-
41. A computer program product that sequences a nucleic acid sequence, comprising:
-
code that receives as input hybridization intensities for a plurality of nucleic acid probes, the nucleic acid probes hybridizing with the nucleic acid sequence under conditions that do not allow identification of all nucleic acid probes of the plurality of nucleic acid probes that are perfectly complementary to part of the nucleic acid sequence;
code that selects nucleic acid probes from the plurality of nucleic acid probes;
code that sequences and nucleic acid sequence according to the selected nucleic acid probes, wherein the code that sequences includes;
code that aligns the selected nucleic acid probes so that bases that are common overlap; and
code that sequences the nucleic acid sequence according to bases that occur most often at each position in the aligned nucleic acid probes; and
a computer readable storage medium that stores the codes. - View Dependent Claims (42)
-
Specification