System and method for deriving a hierarchical event based database optimized for analysis of biological systems
First Claim
1. A computer implemented method for inferring a probability of an Ith inference relating to a biological system, wherein I is an integer reflecting how many times a recursion process has been conducted, the computer implemented method comprising:
- receiving a Ith query at a database, on a data processing system, regarding an Ith fact related to the biological system, wherein the Ith fact becomes a compound fact that includes multiple sub-facts on a subsequent iteration of the recursion process, wherein the Ith inference is absent from the database, wherein the database comprises a plurality of divergent data, wherein the plurality of divergent data includes a plurality of cohort data, wherein each datum of the database is conformed to the dimensions of the database, wherein each datum of the plurality of data has associated metadata and an associated key, wherein the associated metadata comprises data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum;
establishing the Ith fact as a frame of reference for the Ith query, by a processing unit of the data processing system;
mathematically refocusing the database such that the fact is modeled as a first center of an inverted star schema, and modeling each datum of the plurality of data in the inverted star schema around the fact;
applying a Ith set of rules to the Ith query, by the processing unit, wherein the Ith set of rules are determined for the Ith query according to a Jth set of rules, wherein J is equal to I-1, wherein the set of rules determine how the plurality of data are to be compared to the Ith fact, wherein the Ith set of rules is prioritized, and wherein the set of rules determine a search space of for the Ith query including the associated metadata and associated key, wherein the Jth set of rules is a rule set used in a previous iteration of the recursive process;
executing the Ith query, by the processing unit, to create the probability of the inference, wherein the probability of the inference is determined from comparing the Ith search space according to the Ith set of rules;
automatically generating cohort data for the Ith fact; and
storing the probability of the Ith inference and the cohort data for the Ith fact by the processing unit in a memory element of the data processing system, wherein the Ith inference and the cohort data are stored in the database at an atomic level;
wherein the first inference relating to a biological system is selected from the group consisting of an interaction between the biological system and an environmental factor, monitoring the biological system, monitoring the environmental factor, a relationship between a biological pathway and a drug, a relationship between the biological pathway and a food, a relationship between the biological pathway and a substance interacting with the biological pathway, a relationship between the biological pathway and a gene, a relationship between the biological pathway and the environmental factor, and combinations thereof.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method, apparatus, and computer usable program code for inferring a probability of a first inference absent from a database at which a query regarding the inference is received. Each datum of the database is conformed to the dimensions of the database. Each datum of the plurality of data has associated metadata and an associated key. The associated metadata includes data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum. The query is used as a frame of reference for the search. The database returns a probability of the correctness of the first inference based on the query and on the data.
118 Citations
11 Claims
-
1. A computer implemented method for inferring a probability of an Ith inference relating to a biological system, wherein I is an integer reflecting how many times a recursion process has been conducted, the computer implemented method comprising:
-
receiving a Ith query at a database, on a data processing system, regarding an Ith fact related to the biological system, wherein the Ith fact becomes a compound fact that includes multiple sub-facts on a subsequent iteration of the recursion process, wherein the Ith inference is absent from the database, wherein the database comprises a plurality of divergent data, wherein the plurality of divergent data includes a plurality of cohort data, wherein each datum of the database is conformed to the dimensions of the database, wherein each datum of the plurality of data has associated metadata and an associated key, wherein the associated metadata comprises data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum; establishing the Ith fact as a frame of reference for the Ith query, by a processing unit of the data processing system; mathematically refocusing the database such that the fact is modeled as a first center of an inverted star schema, and modeling each datum of the plurality of data in the inverted star schema around the fact; applying a Ith set of rules to the Ith query, by the processing unit, wherein the Ith set of rules are determined for the Ith query according to a Jth set of rules, wherein J is equal to I-1, wherein the set of rules determine how the plurality of data are to be compared to the Ith fact, wherein the Ith set of rules is prioritized, and wherein the set of rules determine a search space of for the Ith query including the associated metadata and associated key, wherein the Jth set of rules is a rule set used in a previous iteration of the recursive process; executing the Ith query, by the processing unit, to create the probability of the inference, wherein the probability of the inference is determined from comparing the Ith search space according to the Ith set of rules; automatically generating cohort data for the Ith fact; and storing the probability of the Ith inference and the cohort data for the Ith fact by the processing unit in a memory element of the data processing system, wherein the Ith inference and the cohort data are stored in the database at an atomic level; wherein the first inference relating to a biological system is selected from the group consisting of an interaction between the biological system and an environmental factor, monitoring the biological system, monitoring the environmental factor, a relationship between a biological pathway and a drug, a relationship between the biological pathway and a food, a relationship between the biological pathway and a substance interacting with the biological pathway, a relationship between the biological pathway and a gene, a relationship between the biological pathway and the environmental factor, and combinations thereof. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer implemented method for building a database capable of inferring a probability of an Ith inference relating to a biological system, wherein I is an integer reflecting how many times a recursion process has been conducted, the computer implemented method comprising:
-
establishing a database structure in a memory element of a data processing system, wherein the Ith fact becomes a compound fact that includes multiple sub-facts on a subsequent iteration of the recursion process, wherein the database structure is adapted to receive a plurality of divergent data, wherein in the database the plurality of divergent data includes a plurality of cohort data, wherein the database is adapted such that each datum of the database is conformed to the dimensions of the database, wherein the database is further adapted such that each datum of the plurality of data has an associated metadata and an associated key, wherein the associated metadata comprises data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum; mathematically refocusing the database such that the fact is modeled as a first center of an inverted star schema, and modeling each datum of the plurality of data in the inverted star schema around the fact; establishing a Jth set of rules, in the memory element of the data processing system for the database structure, the Jth set of rules comprising rules for determining a Ith set of rules to be applied to an Ith query submitted to the database, wherein the Ith query is related to the clinical application, wherein the Ith set of rules determines that a fact submitted with the Ith query will serve as a frame of reference when executing the Ith query, wherein the Ith set of rules determines an Ith first search space for the Ith query, including the associated metadata and associated keys, wherein the Jth set of rules is a rule set used in a previous iteration of a recursive process, wherein the Ith set of rules is prioritized, and wherein the Ith set of rules are adapted to create the probability of the Ith inference, wherein the probability of the Ith inference is determined from comparing the Ith search space according to the Ith set of rules using the Ith fact as the frame of reference by a processing unit of the data processing system, wherein the frame of reference is used to determine data to be searched and rules to apply to the Ith query; receiving a plurality of divergent data in the database by a processing unit on the data processing system; conforming the plurality of divergent data to the dimensions of the database, by the processing unit, to form a plurality of conformed data; associating the metadata and the key with each datum in the plurality of conformed data by the processing unit; and storing the database structure in the memory element of the data processing system; wherein the first inference relating to a biological system is selected from the group consisting of an interaction between the biological system and an environmental factor, monitoring the biological system, monitoring the environmental factor, a relationship between a biological pathway and a drug, a relationship between the biological pathway and a food, a relationship between the biological pathway and a substance interacting with the biological pathway, a relationship between the biological pathway and a gene, a relationship between the biological pathway and the environmental factor, and combinations thereof.
-
-
11. A database stored in a computer-usable medium, the database comprising:
-
a plurality of divergent data stored in a data structure on the computer-usable medium, wherein the computer usable medium comprises memory elements, wherein the Ith fact becomes a compound fact that includes multiple sub-facts on a subsequent iteration of the recursion process, wherein the plurality of divergent data includes a plurality of cohort data, wherein each datum of the database is conformed to the dimensions of the database, wherein each datum of the plurality of data has an associated metadata and an associated key, wherein the associated metadata comprises data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum; computer usable program code stored in the computer-readable storage medium for establishing an Ith fact relating to a clinical application, received in an Ith query relating to the clinical application, as a frame of reference for the Ith query; computer usable program code stored in the computer-usable medium for mathematically refocusing the database such that the fact is modeled as a first center of an inverted star schema, and modeling each datum of the plurality of data in the inverted star schema around the fact; computer usable program code stored in the computer-readable storage medium for applying an Ith set of rules to the Ith query, wherein the Ith set of rules are determined for the Ith query according to a Jth set of rules, wherein J is equal to I-1, wherein the Ith set of rules determine how the plurality of data are to be compared to the Ith fact, and wherein the Ith set of rules determine a Ith search space of the inverted star schema for the Ith query, wherein the Jth set of rules is a rule set used in a previous iteration of a recursive process; computer usable program code stored in the computer-readable storage medium for executing the Ith query to create a probability of an Ith inference, wherein the probability of the Ith inference is determined from comparing the Ith search space according to the Ith set of rules; computer usable program code stored in the computer-readable storage medium for storing the probability of Ith first inference in the database; wherein the first inference relating to a biological system is selected from the group consisting of an interaction between the biological system and an environmental factor, monitoring the biological system, monitoring the environmental factor, a relationship between a biological pathway and a drug, a relationship between the biological pathway and a food, a relationship between the biological pathway and a substance interacting with the biological pathway, a relationship between the biological pathway and a gene, a relationship between the biological pathway and the environmental factor, and combinations thereof.
-
Specification