Kernels and kernel methods for spectral data
First Claim
1. A method for analysis of data contained in a plurality of spectra generated from mass spectrographic measurement of protein samples corresponding to different biological conditions, wherein the different biological conditions have associated different levels of protein expression, the method comprising:
- downloading the plurality of spectra into a computer system comprising a processor and a storage device, wherein the processor is programmed to execute at least one support vector machine and performs the steps of;
aligning the plurality of spectra, comprising;
selecting a first example spectrum as a baseline example;
sliding each spectral peak of a second example spectrum one at a time along a plurality of peaks within the baseline example;
applying a scoring function to obtain a similarity score between each spectral peak of the second example spectrum and the peaks within the baseline example, the similarity score being determined according to the relationship
3 Assignments
0 Petitions
Accused Products
Abstract
Support vector machines are used to classify data contained within a structured dataset such as a plurality of signals generated by a spectral analyzer. The signals are pre-processed to ensure alignment of peaks across the spectra. Similarity measures are constructed to provide a basis for comparison of pairs of samples of the signal. A support vector machine is trained to discriminate between different classes of the samples. to identify the most predictive features within the spectra. In a preferred embodiment feature selection is performed to reduce the number of features that must be considered.
54 Citations
25 Claims
-
1. A method for analysis of data contained in a plurality of spectra generated from mass spectrographic measurement of protein samples corresponding to different biological conditions, wherein the different biological conditions have associated different levels of protein expression, the method comprising:
-
downloading the plurality of spectra into a computer system comprising a processor and a storage device, wherein the processor is programmed to execute at least one support vector machine and performs the steps of; aligning the plurality of spectra, comprising; selecting a first example spectrum as a baseline example; sliding each spectral peak of a second example spectrum one at a time along a plurality of peaks within the baseline example; applying a scoring function to obtain a similarity score between each spectral peak of the second example spectrum and the peaks within the baseline example, the similarity score being determined according to the relationship - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for analysis of mass spectrographic data generated from protein samples comprising different biological conditions, the method comprising:
-
downloading a plurality of spectra comprising the mass spectrographic data into a computer system comprising a processor and a storage device, wherein the processor is programmed to execute at least one support vector machine and performs the steps of; aligning spectra within the mass spectrographic data, comprising; selecting a first example spectrum as a baseline example; sliding each spectral peak of a second example spectrum one at a time along a plurality of peaks within the baseline example; applying a scoring function to obtain a similarity score between the second example spectrum and the baseline example, the similarity score being determined according to the relationship - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for analysis of protein mass spectrographic data contained in a plurality of spectra generated from a plurality of protein samples comprising a plurality of different medical conditions, the data residing in a storage device, the method comprising:
aligning the plurality of spectra using a processor by constructing a similarity measure for comparing pairs of spectra with a baseline spectrum, the similarity measure being determined according to the relationship - View Dependent Claims (24, 25)
Specification