Enhanced data compression for sparse multidimensional ordered series data
First Claim
Patent Images
1. A computer-implemented method of compressing a sparse multidimensional ordered series of spectroscopic data, the method comprising:
- a) receiving the sparse multidimensional ordered series data containing values that fall within a dynamic range of less than 10 orders of magnitude, wherein the data comprise indexed data sets, each indexed data set comprising an index (n), a first variable (xn) representing a mass to charge ratio (m/z), and a second variable (yn) representing signal intensity;
b) defining a predictor that calculates each first variable (xn);
c) assigning an amplitude code word to each yn;
d) calculating a hop offset value (Δ
n) for each yn;
e) assigning a hop code word to each Δ
n based on the value of the Δ
n; and
f) generating a compressed output, said compressed output comprising;
i) a decoder legend comprising;
a reverse amplitude code word dictionary associated with yn; and
a reverse hop code word dictionary associated with Δ
n; and
ii) code word data comprising an amplitude code word and a hop code word for each yn and each Δ
n.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are methods and systems for significantly compressing sparse multidimensional ordered series data comprised of indexed data sets, wherein each data set comprises an index, a first variable and a second variable. The methods and systems are particularly suited for compression of data recorded in double precision floating point format.
67 Citations
19 Claims
-
1. A computer-implemented method of compressing a sparse multidimensional ordered series of spectroscopic data, the method comprising:
-
a) receiving the sparse multidimensional ordered series data containing values that fall within a dynamic range of less than 10 orders of magnitude, wherein the data comprise indexed data sets, each indexed data set comprising an index (n), a first variable (xn) representing a mass to charge ratio (m/z), and a second variable (yn) representing signal intensity; b) defining a predictor that calculates each first variable (xn); c) assigning an amplitude code word to each yn; d) calculating a hop offset value (Δ
n) for each yn;e) assigning a hop code word to each Δ
n based on the value of the Δ
n; andf) generating a compressed output, said compressed output comprising; i) a decoder legend comprising; a reverse amplitude code word dictionary associated with yn; and a reverse hop code word dictionary associated with Δ
n; andii) code word data comprising an amplitude code word and a hop code word for each yn and each Δ
n. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
Specification