Modifications to polynucleotides for sequencing
First Claim
1. A method comprising:
- receiving a plurality of polynucleotide sequences, wherein each of the plurality of polynucleotide sequences includes a calibration region, the calibration region read by a sequencing machine during a calibration process;
performing an analysis of sequences of the plurality of polynucleotide sequences;
determining, based at least partly on the analysis, a number of individual polynucleotide sequences of the plurality of polynucleotide sequences having a same nucleotide located in a same position of the calibration region of each of the plurality of polynucleotide sequences;
determining that the number is greater than a threshold number;
modifying a nucleotide sequence of the calibration region of at least one of the plurality of polynucleotide sequences to produce a modified polynucleotide sequence, wherein, after the modifying, the number is less than the threshold number; and
providing the modified polynucleotide sequence to a polynucleotide synthesizer.
2 Assignments
0 Petitions
Accused Products
Abstract
This disclosure describes techniques to improve the sequencing of polynucleotides by decreasing the likelihood of errors occurring during a sequencing calibration process. In implementations, regions of polynucleotides that are used for the calibration process can be modified to reduce a number of polynucleotides that have a same nucleotide at one or more positions of the calibration regions. In some cases, the calibration regions can be modified by adding a sequence to the polynucleotides that replaces the original calibration regions. Also, the calibration regions can be modified by rearranging the nucleotides at the different positions of the calibration regions. Additionally, the calibration regions can be modified by adding sequences of varying length to the polynucleotides being sequenced to produce polynucleotides having varying length with different calibration regions.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving a plurality of polynucleotide sequences, wherein each of the plurality of polynucleotide sequences includes a calibration region, the calibration region read by a sequencing machine during a calibration process; performing an analysis of sequences of the plurality of polynucleotide sequences; determining, based at least partly on the analysis, a number of individual polynucleotide sequences of the plurality of polynucleotide sequences having a same nucleotide located in a same position of the calibration region of each of the plurality of polynucleotide sequences; determining that the number is greater than a threshold number; modifying a nucleotide sequence of the calibration region of at least one of the plurality of polynucleotide sequences to produce a modified polynucleotide sequence, wherein, after the modifying, the number is less than the threshold number; and providing the modified polynucleotide sequence to a polynucleotide synthesizer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
generating polynucleotide data indicating a polynucleotide sequence including a payload and a file identifier, the payload encoding a segment of digital data and the file identifier being associated with the digital data; determining that a position located in a calibration region of the polynucleotide sequence has a same nucleotide as the same position located in additional calibration regions of at least a threshold number of additional polynucleotide sequences, the calibration region and the additional calibration regions including a sequence of nucleotides used by a sequencing machine during a calibration process of the sequencing machine; modifying a portion of the polynucleotide data corresponding to the calibration region of the polynucleotide to produce a modified polynucleotide sequence having a modified calibration region such that less than the threshold number of the calibration regions include the same nucleotide at the same position; and providing the modified polynucleotide sequence to a polynucleotide synthesizer. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system comprising:
-
a processor; a memory in communication with the processing unit, the memory storing computer-readable instructions that when executed by the processor perform operations comprising; performing an analysis of nucleotide sequence data of a plurality of polynucleotide sequences, wherein individual ones of the plurality of polynucleotide sequences include calibration regions comprising a sequence of nucleotides used by a sequencing machine during a calibration process of the sequencing machine and payload regions that encode a segment of digital data; determining, based at least partly on the analysis, that a likelihood of an error occurring during the calibration process of the sequencing machine while sequencing polynucleotides with nucleotide sequences indicated by the plurality of polynucleotide sequences is at least a threshold probability; modifying the calibration regions of at least a portion of the plurality of polynucleotide sequences to produce a modified plurality of polynucleotide sequences having modified calibration regions such that the likelihood of an error occurring during the calibration process of the sequencing machine while sequencing polynucleotides with nucleotide sequences indicated by the modified plurality of polynucleotide sequences is below the threshold probability; and providing the modified plurality of polynucleotide sequences to a polynucleotide synthesizer. - View Dependent Claims (17, 18, 19, 20)
-
Specification