Feature list extraction from data sets such as spectra
First Claim
1. A data processing method comprising:
- obtaining a plurality of data sets;
applying a criterion to each data set to identify at least one feature in said data set;
retaining features present in at least an occurrence threshold number of said data sets; and
defining a location corresponding to each retained feature.
5 Assignments
0 Petitions
Accused Products
Abstract
A component list extraction method improves the quality of data extracted from a series of spectra, images, or other data sets, resulting in more accurate analysis and data mining. A series of spectra, such as mass spectra, are obtained and thresholded to distinguish peaks from noise. Conventionally, all data below the noise threshold are recorded as having zero intensity, which introduces an artificial discontinuity in the data. Instead, a composite peak list is constructed containing peaks occurring in at least a minimum number of spectra, and intensity values are recorded for corresponding peak locations in all spectra, even those having intensities below the noise threshold. The resulting intensities serve as inputs to a data mining or analysis method. The method can also be used as a peak detection method to determine components characterizing a sample type or patient population. The method is particularly useful for biological marker discovery and image processing.
-
Citations
26 Claims
-
1. A data processing method comprising:
-
obtaining a plurality of data sets;
applying a criterion to each data set to identify at least one feature in said data set;
retaining features present in at least an occurrence threshold number of said data sets; and
defining a location corresponding to each retained feature. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for analyzing a set of spectra, comprising:
-
in each spectrum, identifying candidate peaks;
retaining candidate peaks present in at least an occurrence threshold number of said spectra; and
defining a spectral region corresponding to each retained peak. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A program storage device accessible by a processor, tangibly embodying a program of instructions executable by said processor to perform method steps for a data processing method, said method steps comprising:
-
obtaining a plurality of data sets;
applying a criterion to each data set to identify at least one feature in said data set;
retaining features present in at least an occurrence threshold number of said data sets; and
defining a location corresponding to each retained feature. - View Dependent Claims (22, 23, 24, 25, 26)
-
Specification