Biological data set comparison method
First Claim
1. A method of identifying a relationship between one or more candidate biomolecules and one or more reference biomolecules, the method comprising:
- (a) inputting to a computer a query set describing the one or more candidate biomolecules;
(b) comparing the query set with a target database describing the one or more reference biomolecules, wherein the one or more reference biomolecules are grouped into one or more buckets, and wherein the one or more reference biomolecules of each bucket share a common property;
(c) counting a number of matches between each query set and each bucket of the target database; and
(d) statistically analyzing each match, wherein the presence of a statistically significant match identifies a relationship between the query set and a bucket of the target database.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of identifying a relationship between a set of one or more candidate biomolecules and a set of one or more reference biomolecules, the method including inputting to a computer a query set describing the one or more candidate biomolecules; comparing the query set with a target database describing the one or more reference biomolecules wherein the one or more reference biomolecules grouped into one or more buckets and wherein the one or more reference biomolecules of each bucket share a common property; counting a number of matches between each query set and each buckets of the target database; and statistically analyzing the number of matches to each bucket wherein the presence of a statistically significant match identifies a relationship between a the query set and a bucket of the target database.
-
Citations
66 Claims
-
1. A method of identifying a relationship between one or more candidate biomolecules and one or more reference biomolecules, the method comprising:
-
(a) inputting to a computer a query set describing the one or more candidate biomolecules;
(b) comparing the query set with a target database describing the one or more reference biomolecules, wherein the one or more reference biomolecules are grouped into one or more buckets, and wherein the one or more reference biomolecules of each bucket share a common property;
(c) counting a number of matches between each query set and each bucket of the target database; and
(d) statistically analyzing each match, wherein the presence of a statistically significant match identifies a relationship between the query set and a bucket of the target database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-readable storage device embodying a program of instructions executable by a computer to perform method steps for identifying a relationship between one or more candidate biomolecules and one or more reference biomolecules, the method steps comprising:
-
(a) inputting to a computer a query set describing one or more candidate biomolecules;
(b) comparing the query set with a target database describing one or more reference biomolecules, the one or more reference biomolecules of the target database grouped into one or more buckets, wherein the one or more reference biomolecules of each bucket share a common property;
(c) counting a number of matches between each query set and each bucket of the target database; and
(d) statistically analyzing each match, wherein the presence of a statistically significant match identifies a relationship between a query set and one or more buckets of a target database. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method of identifying a relationship between two or more region sets, each region set describing one or more candidate biomolecules, and a target database describing one or more reference biomolecules grouped into one or more buckets, the method comprising:
-
(a) providing a query set describing two or more region sets, each region set comprising one or more candidate biomolecule sequences extracted from one region;
(b) comparing the query set with target database sequences describing one or more reference biomolecule sequences, wherein the target database sequences are grouped into one or more buckets, and wherein the one or more reference biomolecules of each bucket share a common property;
(c) counting a number of matches between each query set and each bucket of the target database; and
(d) statistically analyzing each match, wherein the presence of a statistically significant match identifies a relationship between the query set and the bucket of the target database. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A computer-readable storage device embodying a program of instructions executable by a computer to perform method steps for identifying a relationship between two or more region sets, each region set each region set describing one or more candidate biomolecules, and a target database describing one or more reference biomolecules grouped into one or more buckets, the method steps comprising:
-
(a) providing a query set describing two or more region sets, each region set comprising one or more candidate biomolecule sequences extracted from one genetic region;
(b) comparing the query set with target database sequences describing one or more reference biomolecule sequences, wherein the target database sequences grouped into one or more buckets, and wherein the one or more reference biomolecules of each bucket share a common property;
(c) counting a number of matches between each query set and each bucket of the target database; and
(d) statistically analyzing each match, wherein the presence of a statistically significant match identifies a relationship between the query set and the bucket of the target database. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. A computer-readable medium having stored thereon a data structure having multiple data fields comprising:
-
(a) a first data field containing data representing a bucket;
(b) a second data field containing data representing a name for the bucket; and
(c) a third data field containing data representing a list of members of the bucket, wherein the members have a common property. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
-
55. A method of making a target database, the method comprising:
-
(a) identifying a source of informative content;
(b) arranging informative content from the source of informative content into a set of buckets, wherein the buckets are given names;
(c) gathering the names of the buckets and a list of biomolecules present in each bucket; and
(d) creating and loading into a database data fields containing data representing;
(i) the set of buckets;
(ii) the list of biomolecules present in each bucket; and
(iii) a description for each biomolecule present in each bucket. - View Dependent Claims (56, 57, 58, 59, 60)
-
-
61. A computer-readable storage device embodying a program of instructions executable by a computer to perform method steps for making a target database, the method steps comprising:
-
(a) identifying a source of informative content;
(b) arranging informative content from the source of informative content into a set of buckets, wherein the buckets are given names;
(c) gathering the names of the buckets and a list of biomolecules present in each bucket; and
(d) creating and loading into a database data fields containing data representing;
(i) the set of buckets;
(ii) the list of biomolecules present in each bucket; and
(iii) a description for each biamolecule present in each bucket. - View Dependent Claims (62, 63, 64, 65, 66)
-
Specification