Method for reducing cross-talk within DNA data
First Claim
1. A method for enhancing DeoxyriboNucleic Acid (DNA) raw data comprising the steps of:
- (a) providing an apparatus for collecting DNA data from dye-labeled DNA fragments, the DNA data being divided between a plurality of channels;
(b) passing the DNA data contained in each of the plurality of channels through a filter to reduce cross-talk between DNA data contained in each of the channels; and
(c) adjusting the baseline of the DNA data contained in each of the channels.
1 Assignment
0 Petitions
Accused Products
Abstract
Raw DNA data is filtered with a multi-component analysis that is applied to the difference of the signal intensity on each of the raw DNA data signals to remove cross talk between the signals. The analysis is done before any baseline adjustment of the raw DNA data. Instead, the baseline adjustment occurs after the raw DNA data has been filtered. Additionally, an additional processing step is applied to the data to account for the non-linear nature of cross talk filtering. The additional processing step involves combining the signal with its derivative to account for the correlation of each of the data signals with the other three data signals.
41 Citations
19 Claims
-
1. A method for enhancing DeoxyriboNucleic Acid (DNA) raw data comprising the steps of:
-
(a) providing an apparatus for collecting DNA data from dye-labeled DNA fragments, the DNA data being divided between a plurality of channels;
(b) passing the DNA data contained in each of the plurality of channels through a filter to reduce cross-talk between DNA data contained in each of the channels; and
(c) adjusting the baseline of the DNA data contained in each of the channels. - View Dependent Claims (2, 3, 4, 5, 6, 7)
(b1) determining difference values for the signals in each channel that correspond to the DNA data by subtracting the magnitudes of the signals in each of the channels at two consecutive sampling instants;
(b2) applying a multi-component analysis to the difference values obtained in step (b1) to deconvolute the DNA data contained in the signals; and
(b3) recombining the deconvoluted difference data with the corresponding signals at the specific sampling instant to obtain the signal intensity.
-
-
3. The method according to claim 2 wherein the multi-component analysis in step (b2) includes multiplying the signals corresponding to the DNA data by a constant coefficient transformation matrix M.
-
4. The method according to claim 3 wherein the multi-component analysis in step (b2) includes the following operation:
-
where Δ
sj represents the variation of the measured signal sj in each channel that corresponds to the DNA data between two consecutive signal measurements and Δ
sj represents filtered signal variation with crosstalk removed; and
mi,j is a constant coefficient indicating the cross talk between measured signal varation sj and the filtered signal variation Δ
sj.
-
-
5. The method according to claim 4 wherein, prior to step (b), the signal corresponding to the DNA data is passed through a low pass filter.
-
6. The method according to claim 5 further including, before adjusting the baseline of the DNA data in step (c), passing the signal corresponding to the DNA data in each of the channels through a high pass filter.
-
7. The method according to claim 6 further including, subsequent to the baseline adjustment in step (c), locating peak values in each channel and reading a DNA sequence from a combination of the DNA data contained in each of the channels.
-
8. A method for enhancing DeoxyriboNucleic Acid (DNA) raw data comprising the steps of:
-
(a) providing an apparatus for collecting DNA data from dye-labeled DNA fragments, said data divided between a plurality of channels;
(b) passing the DNA data in each channel through a first filter to reduce cross-talk between DNA data contained in each of the channels;
(c) passing the filtered DNA data in each channel from step (b) through a second filter to reduce any non-linearity remaining after the first filtering process in step (b);
(d) recombining the filtered DNA data in each channel from step (c) with corresponding signals at a specific sampling instant to obtain a filtered signal intensity for each channel; and
(e) adjusting the baseline of the DNA data contained in each of the channels. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
(c1) determining derivative values for the filtered DNA data obtained from the first filter in step (b); and
(c2) applying a multi-component analysis to the derivative values obtained in step (c1) to remove non-linear effects remaining after the first filtering process.
-
-
10. The method according to claim 9 wherein the multi-component analysis in step (c2) includes multiplying the derivative values obtained in step (c1) by a constant coefficient transformation matrix T.
-
11. The method according to claim 10 wherein the multi-component analysis in step (c2) includes the following operation:
-
where Δ
si represents the variation of the DNA data after the second filtering process in step (c);
Δ
si represents the variation of the DNA data after the first filtering process in step (b);
Δ
sj′
represents the time derivative of Δ
si; and
ti,j is a constant coefficient indicating an approximated linear relationship between intensity Δ
si and Δ
sj′
.
-
-
12. The method according to claim 11 wherein the reduction of cross talk between each of the channels in step (b) includes the steps of:
-
(b1) determining difference values for the DNA data in each channel by subtracting the magnitudes of the DNA data in each channel at two consecutive sampling instants; and
(b2) applying a multi-component analysis to the difference values obtained in step (b1) to deconvolute the DNA data contained in each channel.
-
-
13. The method according to claim 12 wherein the multi-component analysis in step (b2) includes multiplying the DNA data by a constant coefficient transformation matrix M.
-
14. The method according to claim 13 wherein the multi-component analysis in step (b2) includes the following operation:
-
where Δ
sj represents the variation of the measured DNA data sj between two consecutive DNA data measurements and Δ
sj represents filtered DNA data variation with crosstalk removed; and
mi,j is a constant coefficient indicating the cross talk between measured DNA data varation sj and the filtered signal variation Δ
sj.
-
-
15. The method according to claim 14 wherein, prior to step (b), the DNA data is passed through a low pass filter.
-
16. The method according to claim 15 further including, before adjusting the baseline of the DNA data in step (e), passing the DNA data through a high pass filter.
-
17. The method according to claim 16 further including, subsequent to the baseline adjustment in step (e), locating peak values in the DNA data in each channel and reading a DNA sequence from a combination of the DNA data contained in each of the channels.
-
18. An algorithm for processing DeoxyriboNucleic Acid (DNA) data, the DNA data being divided between a plurality of channels, the algorithm comprising the steps of:
-
(a) measuring a signal in each channel that corresponds to the DNA data associated with the channel;
(b) determining difference values for the measured signals in each channel by subtracting the magnitudes of the measured signals in each channel at two consecutive sampling instants;
(c) passing the difference values for the measured signals in each of the channels through a first filter to reduce cross-talk between the measured signals contained in the channels that includes the following operation;
where Δ
sj represents the variation of the measured signal sj between two consecutive signal measurements and Δ
sj represents filtered measured signal variation with crosstalk removed; and
mi,j is a constant coefficient indicating the cross talk between measured signal varation sj and the filtered signal variation Δ
sj;
(d) passing the filtered measured signals from step (c) in each of the channels through a second filter to reduce any non-linearity remaining after the first filtering process in step (c) that includes the following operation;
where Δ
si represents the variation of the data signal after the second filtering process;
Δ
si represents the variation of the measured signal after the first filtering process in step (c);
Δ
sj′
represents the time derivative of Δ
si; and
ti,j is a constant coefficient indicating the non-linear relationship between Δ
si and Δ
sj′
;
(e) reconstructing the measured signals in each of the channels; and
(f) adjusting the baseline of the measured signal contained in each of the channels.
-
-
19. An algorithm for processing DeoxyriboNucleic Acid (DNA) data, the DNA data being divided between a plurality of channels, the algorithm comprising the steps of:
-
(a) passing the DNA data in each channel through a first filter to reduce cross-talk between DNA data contained in each of the channels that includes the following operation;
where sj represents a signal corresponding to the filtered DNA data and fj represents fluorescence intensity; and
mi,j is a constant coefficient indicating the cross talk between intensity signals i and j;
(b) passing the filtered DNA data signal in each of the channels through a second filter to reduce any non-linearity introduced by the first filtering process in step (a) that includes the following operation;
where si represents the variation of the DNA data signal after the second filtering process in step (b);
si represents the measured signal after the first filtering process in step (a);
sj′
represents the time derivative of si; and
ti,j is a constant coefficient indicating the non-linear relationship between intensity Δ
si and Δ
sj′
;
(c) reconstructing the DNA data signals in each of the channels; and
(d) adjusting the baseline of the DNA data contained in each of the channels.
-
Specification