Methods and computer program products for compression of sequencing data
First Claim
1. A compression method, comprising:
- measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises a plurality of measured values and the chemical event is indicative of a number of nucleotide incorporations in a genetic sequencing reaction; and
applying a first compression process to the waveform using a processor, the first compression process including a truncating of data corresponding to a portion of the waveform that is not related to nucleotide incorporations in the genetic sequencing reaction thereby forming a compressed data structure andstoring the compressed data structure in a memory.
2 Assignments
0 Petitions
Accused Products
Abstract
A compression method includes: measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises a plurality of measured values and the chemical event is indicative of a number of nucleotide incorporations in a genetic sequencing reaction; applying a first compression process to the waveform, the first compression process including a truncating of data corresponding to a portion of the waveform that is not related to nucleotide incorporations in the genetic sequencing reaction; and applying a second compression process to the waveform, the second compression process including a data substitution process that replaces at least a portion of the waveform with a plurality of coefficients representative of the portion of the waveform.
32 Citations
20 Claims
-
1. A compression method, comprising:
-
measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises a plurality of measured values and the chemical event is indicative of a number of nucleotide incorporations in a genetic sequencing reaction; and applying a first compression process to the waveform using a processor, the first compression process including a truncating of data corresponding to a portion of the waveform that is not related to nucleotide incorporations in the genetic sequencing reaction thereby forming a compressed data structure and storing the compressed data structure in a memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product comprising a non-transitory computer-usable medium having computer program logic recorded thereon that, when executed by one or more processors, samples and compresses data from a sensor array, the computer program logic comprising:
-
first computer readable program code that enables a processor to measure a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises a plurality of measured values and the chemical event is indicative of a number of nucleotide incorporations in a genetic sequencing reaction; and second computer readable program code that enables a processor to apply a first compression process to the waveform, the first compression process including a truncating of data corresponding to a portion of the waveform that is not related to nucleotide incorporations in the genetic sequencing reaction thereby forming a compressed data structure and storing the compressed data structure in a memory. - View Dependent Claims (17)
-
-
18. A method for compressing nucleic acid sequencing data, comprising:
-
obtaining raw data from a semiconductor-based sequencing sensor array comprising a plurality of sensors during a data acquisition time period, the raw data comprising at least a non-informative portion corresponding to a subinterval of the data acquisition time period having a location within the data acquisition time period that varies for different sensors according to a position of the sensor in the sensor array; and transforming the raw data into compressed data using a lossy compression process including a data truncation process, the data truncation process being related for each sensor to the position of the sensor in the sensor array and configured to discard the non-informative portion of the raw data thereby forming a compressed data structure and storing the compressed data structure in a memory. - View Dependent Claims (19, 20)
-
Specification