Clinical data aggregation architecture and platform
First Claim
Patent Images
1. A system for loading and mapping source data into a predetermined standard format, the system comprising:
- a data hub configured to receive the source data from one or more external databases and to store the received source data;
a graphical user interface comprised of multiples screens for displaying on a display screen, the graphical user interface having;
i) an area having a user selectable list of data formats of the source data;
ii) an area configured to allow a user to input a plurality of key identifiers for the source data, wherein each key identifier identifies a predetermined data item or items of the source data;
iii) an area for setting mapping rules that are operative to map a plurality of data items to a plurality of target fields or variables of the predetermined standard format;
iv) an area for inputting a plurality of expressions, wherein each of the expressions are used to transform a predetermined data item or items of the source data into a target value;
a hardware processing system, the hardware processing system programmed with instructions for executing on the hardware processing system for;
1) extracting data items of the source data from the one or more external databases;
2) loading the extracted data items into the data hub as pairs of key identifier and value for each data item or items; and
3) transforming the loaded source data using the mapping rules into a predetermined standard format.
0 Assignments
0 Petitions
Accused Products
Abstract
A clinical data aggregation system and method, comprising ingesting, transforming and storing data in a clinical data lake. The present invention uses an Extract Load and Transform (ELT) rather than traditional Extract Transform and Load (ETL) design principle. The data hub platform leverages modern noSQL databases which makes the platform highly flexible to configure studies with any design complexity with relative ease.
-
Citations
26 Claims
-
1. A system for loading and mapping source data into a predetermined standard format, the system comprising:
-
a data hub configured to receive the source data from one or more external databases and to store the received source data; a graphical user interface comprised of multiples screens for displaying on a display screen, the graphical user interface having; i) an area having a user selectable list of data formats of the source data; ii) an area configured to allow a user to input a plurality of key identifiers for the source data, wherein each key identifier identifies a predetermined data item or items of the source data; iii) an area for setting mapping rules that are operative to map a plurality of data items to a plurality of target fields or variables of the predetermined standard format; iv) an area for inputting a plurality of expressions, wherein each of the expressions are used to transform a predetermined data item or items of the source data into a target value; a hardware processing system, the hardware processing system programmed with instructions for executing on the hardware processing system for;
1) extracting data items of the source data from the one or more external databases;
2) loading the extracted data items into the data hub as pairs of key identifier and value for each data item or items; and
3) transforming the loaded source data using the mapping rules into a predetermined standard format. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for loading and mapping source data into a predetermined standard format, the system comprising:
-
a data hub configured to receive the source data from one or more external databases and to store the received source data; an inbound data manager, as a component of the data hub, that acts as an interface between the data hub and the one or more external databases and is configured to load the source data to the data hub with no transformation to the source data; a graphical user interface comprised of multiple screens for displaying on a display screen, the graphical user interface having; i) a first area having a user selectable list of types of source data that can be loaded by the system; ii) a second area having a user selectable list of data formats of the source data; iii) a third area configured to allow a user to input a plurality of key identifiers for the source data, wherein each key identifier identifies a predetermined data item or items of the source data; iv) a fourth area for setting mapping rules by mapping a plurality of data items to a plurality of target fields or variables of the predetermined standard format; v) a fifth area for inputting a plurality of expressions, wherein each of the expressions are used to transform a predetermined data item or items of the source data into a target value; a hardware processing system, the hardware processing system programmed with instructions for executing on the hardware processing system for;
1) extracting data items of the source data from the one or more external databases;
2) loading the extracted data items of the source data without transformation into the data hub as pairs of key identifier and value for each data item or items;
3) transforming the loaded source data using the mapping rules into a predetermined standard format;
4) comparing loaded source data with stored ontology data; and
5) mapping data items from the source data to variables of the predetermined standard format. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for loading and mapping source data into a predetermined standard format, the system comprising:
-
a data hub configured to receive the source data from one or more external databases and to store the received source data in a noSQL database format; an inbound data manager, as a component of the data hub, that acts as an interface between the data hub and the one or more external databases and is configured to load the source data to the data hub with no transformation to the source data; a graphical user interface comprised of multiple screens for displaying on a display screen, the graphical user interface having; i) a first area having a user selectable list of types of source data that can be loaded by the system; ii) a second area having a user selectable list of data formats of the source data; iii) a third area configured to allow a user to input a plurality of key identifiers for the source data, wherein each key identifier identifies a predetermined data item or items of the source data; iv) a fourth area for setting configurable mapping rules by mapping a plurality of data items to a plurality of target fields or variables of the predetermined standard format; v) a fifth area for inputting a plurality of expressions, wherein each of the expressions are used to transform a predetermined data item or items of the source data into a target value; a hardware processing system, the hardware processing system programmed with instructions for executing on the hardware processing system for;
1) extracting data items of the source data from the one or more external databases;
2) loading the extracted data items of the source data without transformation into the data hub as pairs of key identifier and value for each data item or items;
3) transforming the loaded source data using the configurable mapping rules and expressions through a built-in rules engine into a predetermined standard format;
4) comparing loaded source data with stored ontology data and mapping data items from the source data to variables of the predetermined standard format;
5) comparing the source data with a model of the predetermined standard format and using pattern matching algorithms to automatically map the source data to the predetermined standard format; and
6) learning previously applied mapping rules and automatically applying said mapping rules to future occurrences of similar data transformations. - View Dependent Claims (21, 22, 23, 24, 25, 26)
-
Specification