Method and system for pattern analysis, such as for analyzing oligonucleotide primer extension assay products
First Claim
1. A system for generating information based on biological samples, comprising:
- a biological assay device that analyzes a two dimensionally spaced set of biological samples and produces a set of data therefrom, wherein the biological samples include at least one control sample, wherein the set of data includes x-axis and y-axis spatial coordinate values and a quantitative value for each biological sample, and wherein the set of data may include unwanted values; and
a computer having a processor and memory, wherein the computer receives the set of data from the biological assay device, stores the set of data in the memory, and wherein the processor;
determines a two dimensional location of the at least one control sample based on the set of data;
identifies x-axis and y-axis spatial coordinates for the biological samples in the set of data based on the determined two dimensional location of the at least one control sample;
filters the quantitative values of for at least some of the biological samples, from the unwanted values, in the set of data, based on the identified x-axis and y-axis spatial coordinates; and
analyzes the filtered quantitative values of each biological sample, respectively, by comparing the quantitative value of a first portion of the filtered quantitative values of each respective biological sample at a first pair of x-axis and y-axis spatial coordinates with a sum of the quantitative value of the first portion and the quantitative value of a second portion of the filtered quantitative values of each respective biological sample at a second pair of x-axis and y-axis spatial coordinates to produce a displayable report of the analyzed quantitative values.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and corresponding method analyzes data for patterns, such as data produced from DNA sequencers. An ASCII text file, generated by DNA sequencer software, is used as the input source for the pattern analysis software program. This text file contains peak intensity data, identified by gel-lane and mobility, for bands corresponding to 2 sequences of differing length. The software uses gel-mobility values of control DNA samples to identify bands of interest from test samples in the dataset. Spurious data point or artifacts are filtered out in this selection process. The selected data is then imported to a second software program that performs algorithms such as linear progression and curve fitting.
-
Citations
35 Claims
-
1. A system for generating information based on biological samples, comprising:
-
a biological assay device that analyzes a two dimensionally spaced set of biological samples and produces a set of data therefrom, wherein the biological samples include at least one control sample, wherein the set of data includes x-axis and y-axis spatial coordinate values and a quantitative value for each biological sample, and wherein the set of data may include unwanted values; and
a computer having a processor and memory, wherein the computer receives the set of data from the biological assay device, stores the set of data in the memory, and wherein the processor;
determines a two dimensional location of the at least one control sample based on the set of data;
identifies x-axis and y-axis spatial coordinates for the biological samples in the set of data based on the determined two dimensional location of the at least one control sample;
filters the quantitative values of for at least some of the biological samples, from the unwanted values, in the set of data, based on the identified x-axis and y-axis spatial coordinates; and
analyzes the filtered quantitative values of each biological sample, respectively, by comparing the quantitative value of a first portion of the filtered quantitative values of each respective biological sample at a first pair of x-axis and y-axis spatial coordinates with a sum of the quantitative value of the first portion and the quantitative value of a second portion of the filtered quantitative values of each respective biological sample at a second pair of x-axis and y-axis spatial coordinates to produce a displayable report of the analyzed quantitative values. - View Dependent Claims (2, 3, 4, 5, 6, 8, 9)
identifies the x-axis and y-axis spatial coordinates for the biological samples in the set of data based on determined lane and scan numbers of the at least one control sample and, identifies scan numbers for the biological samples that are within a predetermined stringency amount of the scan number of the at least one control sample.
-
-
5. The system of claim 1 wherein the biological assay device includes a DNA sequencer for analyzing an electrophoresis gel having lanes and rows of mitochondrial and extramitochondrial samples, and wherein the processor determines a percent mitochondrial load of the biological samples based on the filtered quantitative values.
-
6. The system of claim 1 wherein the computer includes a network interface, and wherein the computer is coupled to the biological assay device through the network interface.
-
8. The system of claim 1 wherein the computer includes a spreadsheet application, and wherein the processor executes at least one macro for the spreadsheet application that performs the identifying x-axis and y-axis values and the filtering of the quantitative values.
-
9. The system of claim 1 wherein the processor provides slope correction and stringency adjustments for filtering the quantitative values.
-
7. The system of claim wherein the computer includes a web browser, and wherein the displayable report is provided to the web browser for transmission over the Internet.
-
10. A method for generating information based on biological samples, comprising:
-
receiving a set of data produced from testing a set of biological samples, wherein the biological samples include test samples and at least one control sample related to at least some of the test samples by space, time or frequency, wherein the set of data includes first and second values and a quantitative value for each test sample;
identifying first or second values corresponding to the control sample in the set of data;
identifying first or second values for at least some of the test samples in the set of data based on the identified first or second values of the control sample and the spatial, temporal or frequency relationship between the control sample and the at least some of the test samples;
filtering the quantitative values for the at least some of the test samples in the set of data based on the identified first or second values; and
analyzing the filtered quantitative values of a selected test sample by comparing the quantitative value of a first portion of the filtered quantitative values of the selected test sample at a first pair of x-axis and y-axis spatial coordinates with a sum of the quantitative value of the first portion and the quantitative value of a second portion of the filtered quantitative values of the selected test sample at a second pair of x-axis and y-axis spatial coordinates. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
identifying a lane for the test samples in the set of data based on determined lane and scan numbers of the control sample; and
, identifying scan numbers for the test samples that are within a predetermined stringency amount of the scan number of the control sample.
-
-
18. The method of claim 10 wherein the first and second values in the set of data correspond respectively to x-axis and y-axis spatial coordinates and wherein identifying first or second values for at least some of the test samples in the set of data identifies x- or y-axis coordinates for the test samples that are within a predetermined stringency amount of x- or y-axis coordinates of the control sample.
-
19. A computer-readable medium for storing computer readable instructions, the instructions being capable of programming a computer to perform a method, the method comprising:
-
receiving a set of data produced from testing a set of biological samples, wherein the biological samples include test samples and at least one control sample related to at least some of the test samples by space, time or frequency, wherein the set of data includes first and second values and a quantitative value for each test sample;
identifying first or second values corresponding to the control sample in the set of data;
identifying first or second values for at least some of the test samples in the set of data based on the identified first or second values of the control sample and the spatial, temporal or frequency relationship between the control sample and the at least some of the test samples;
filtering the quantitative values for the at least some of the test samples in the set of data based on the identified first or second values; and
analyzing the filtered quantitative values of a selected one of the test samples by comparing the quantitative value of a first portion of the filtered quantitative values of the selected test sample at a first pair of x-axis and y-axis spatial coordinates with a sum of the quantitative value of the first portion and the quantitative value of a second portion of the filtered quantitative values of the selected test sample at a second pair of x-axis and y-axis spatial coordinates. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
identifying a lane for the test samples in the set of data based on determined lane and scan numbers of the control sample; and
,identifying scan numbers for the test samples that are within a predetermined stringency amount of the scan number of the control sample.
-
-
27. The computer-readable medium of claim 19 wherein the first and second values in the set of data correspond respectively to x-axis and y-axis spatial coordinates and wherein identifying first or second values for at least some of the test samples in the set of data includes identifying x- or y-axis coordinates for the test samples that are within a predetermined amount of the x- or y-axis coordinates of the control sample.
-
28. A method for displaying to a user information from biological samples, comprising:
-
receiving a set of data produced from testing a set of biological samples, wherein the biological samples include test samples and at least one control sample related to at least some of the test samples by space, time or frequency, and wherein the set of data includes first and second values and a quantitative value for each test sample;
identifying first or second values corresponding to the control sample in the set of data;
identifying first or second values for at least some of the test samples in the set of data based on the identified first or second values of the control sample and the spatial, temporal or frequency relationship between the control sample and the at least some of the test samples;
filtering the quantitative values for the at least some of the test samples in the set of data based on the identified first or second values;
analyzing each of the test samples by calculating a ratio of a quantitative value of a first component of the first or second value for each test sample to the sum of the quantitative value of the first component and a quantitative value of a second component of the first or second value for each respective test sample; and
displaying the filtered quantitative values, so that only quantitative values for test samples of interest are displayed. - View Dependent Claims (29, 30, 31)
-
-
32. A method for generating information based on biological samples, comprising:
-
receiving a set of data produced from testing a set of biological samples, wherein the biological samples includes a test sample;
filtering the test sample to separate the test sample into at least first and second components;
analyzing the test sample by calculating a ratio of the quantitative value of the first component to the quantitative value of a sum of the first and second components; and
displaying results of the analysis. - View Dependent Claims (33, 34, 35)
filtering the control sample in conjunction with filtering the test sample wherein the control sample is related to the first or second components of the test sample by space, time or frequency; and
identifying the first or second components related to the control sample in the set of data, whereby the analysis is performed on a selected one of the first and second identified components.
-
Specification