System and method for scientific information knowledge management
First Claim
1. A computer-implemented method of providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and associated statistics indicating one or more of:
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and the pre-existing feature groups each including a list of features related by structure or function, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising;
(a) correlating by one or more processors of a computer system an input feature set against a plurality or all of the pre-existing feature sets in the knowledge base, the input feature set including a list of features and associated statistics indicating one or more of;
differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;
(b) correlating by one or more processors of the computer system the input feature set against one or more pre-existing feature groups in the knowledge base;
(c) storing on one or more storage devices correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets; and
prior to (a), mapping by one or more processors of the computer system each feature in the input feature set to one or more mapping identifiers in the knowledge base, wherein each mapping identifier represents a globally unique feature in the knowledge base.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to methods, systems and apparatus for capturing, integrating, organizing, navigating and querying large-scale data from high-throughput biological and chemical assay platforms. It provides a highly efficient meta-analysis infrastructure for performing research queries across a large number of studies and experiments from different biological and chemical assays, data types and organisms, as well as systems to build and add to such an infrastructure.
46 Citations
39 Claims
-
1. A computer-implemented method of providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and associated statistics indicating one or more of:
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and the pre-existing feature groups each including a list of features related by structure or function, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising;
(a) correlating by one or more processors of a computer system an input feature set against a plurality or all of the pre-existing feature sets in the knowledge base, the input feature set including a list of features and associated statistics indicating one or more of;
differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) correlating by one or more processors of the computer system the input feature set against one or more pre-existing feature groups in the knowledge base; (c) storing on one or more storage devices correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets; and prior to (a), mapping by one or more processors of the computer system each feature in the input feature set to one or more mapping identifiers in the knowledge base, wherein each mapping identifier represents a globally unique feature in the knowledge base. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 18, 19, 36)
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and the pre-existing feature groups each including a list of features related by structure or function, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising;
-
14. A computer program product comprising a machine readable non-transitory medium on which is provided program instructions for providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and associated statistics indicating one or more of:
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising;
(a) code for receiving an input feature set, the input feature set including a list of features and associated statistics indicating one or more of;
differential expression, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) code for mapping each feature in the input feature set to one or more mapping identifiers in the knowledge base, wherein each mapping identifier represents a globally unique feature in the knowledge base; (c) code for correlating the input feature set against a plurality or all of the pre-existing feature sets in the knowledge base; (d) code for correlating the input feature set against one or more pre-existing feature groups in the knowledge base, wherein the feature groups provide collections of features having structural and/or functional characteristics in common; and (e) code for storing correlation information generated in (c) and (d) for use in responding to queries involving feature groups or feature sets. - View Dependent Claims (15, 16, 17, 38)
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising;
-
20. A computer-implemented method of providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and information about one or more of:
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising;
(a) correlating by one or more processors of a computer system an input feature set against a plurality pre-existing feature sets in the knowledge base, the input feature set including a list of features and information about one or more of;
differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) correlating by one or more processors of a computer system the input feature set against a plurality of pre-existing feature groups in the knowledge base, and (c) storing on one or more storage devices correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 37)
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising;
-
28. A computer program product comprising a machine readable non-transitory medium on which is provided program instructions for providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and information about one or more of:
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising;
(a) code for correlating an input feature set against a plurality pre-existing feature sets in the knowledge base, the input feature set including a list of features and information about one or more of;
differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) code for correlating the input feature set against a plurality of pre-existing feature groups in the knowledge base, and (c) code for storing correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 39)
- differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising;
Specification