Aggregation, partitioning, and management of healthcare data for efficient storage and processing
First Claim
1. A computerized method carried out by at least one server having at least one processor for aggregating, partitioning, and storing raw healthcare data for efficient retrieval, the method comprising:
- receiving, at a staging platform associated with a first data center, healthcare data from a plurality of disparate healthcare data sources of a distributed computing environment;
implementing a multi-level partitioning strategy for efficient access to a plurality of healthcare data batches, the multi-level partitioning strategy eliminating the need to process healthcare data that is not required for a computing solution, wherein the multi-level partitioning strategy comprises;
(A) aggregating, using the at least one processor, one or more subsets of the healthcare data having the same file type into batches; and
(B) partitioning the batches according to healthcare data source in order to handle a variety of payload sizes and data types while reducing an amount of storage needed;
storing the healthcare data batches in a long term storage data store associated with a second data center of the plurality of disparate data centers of the distributed computing environment;
receiving a request from a processing node for at least one of the healthcare data batches;
accessing the at least one requested healthcare data batches from the long-term storage data store; and
delivering the at least one requested healthcare data batches, where the at least one requested healthcare data batches is processed to generate clinically relevant outcome information.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable media are provided for aggregating, partitioning, and storing healthcare data. Healthcare data is collected from various disparate healthcare data sources. The data is aggregated into batches of the same type of data. From here, the data is partitioned according to the data'"'"'s originating healthcare data source. The aggregated and partitioned healthcare data is then stored in a long term storage data store. This system of storing healthcare data allows for efficient retrieval and processing by computing solutions that need access to batches of healthcare data. The system also reduces costs associated with storing data as duplicate storage is eliminated.
5 Citations
18 Claims
-
1. A computerized method carried out by at least one server having at least one processor for aggregating, partitioning, and storing raw healthcare data for efficient retrieval, the method comprising:
-
receiving, at a staging platform associated with a first data center, healthcare data from a plurality of disparate healthcare data sources of a distributed computing environment; implementing a multi-level partitioning strategy for efficient access to a plurality of healthcare data batches, the multi-level partitioning strategy eliminating the need to process healthcare data that is not required for a computing solution, wherein the multi-level partitioning strategy comprises; (A) aggregating, using the at least one processor, one or more subsets of the healthcare data having the same file type into batches; and (B) partitioning the batches according to healthcare data source in order to handle a variety of payload sizes and data types while reducing an amount of storage needed; storing the healthcare data batches in a long term storage data store associated with a second data center of the plurality of disparate data centers of the distributed computing environment; receiving a request from a processing node for at least one of the healthcare data batches; accessing the at least one requested healthcare data batches from the long-term storage data store; and delivering the at least one requested healthcare data batches, where the at least one requested healthcare data batches is processed to generate clinically relevant outcome information. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented system for aggregating, archiving, and managing healthcare data, the computer-implemented system comprising:
-
one or more disparate originating healthcare data sources; one or more processing nodes; a staging component that receives and indexes one or more pieces of healthcare data from a plurality of disparate originating healthcare data sources so that it is optimized for low-latency processing by storing the data in a raw form with additional metadata describing a source and a time received for the data; an extraction component that extracts one or more pieces of healthcare data received and indexed at the staging component from a plurality of disparate originating healthcare data sources; an aggregating component that aggregates a subset of the healthcare data extracted by the extraction component, the aggregating component aggregating the subset of healthcare data having the same type into batches; a partitioning component that sorts the batches by originating healthcare data source; and a long-term storage data store for storing the batches; wherein the staging component further receives a request from the one or more processing nodes for at least one of the batches; wherein the staging component further accesses the at least one of the batches in the long-term storage data store; and wherein the staging component further delivers the at least one of the batches of healthcare data to the processing node where the at least one of the requested batches will be processed to produce a clinically relevant outcome. - View Dependent Claims (9, 10, 11, 12)
-
-
13. One or more non-transitory computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method of aggregating, partitioning, and storing healthcare data, the method comprising:
-
extracting a set of healthcare data from one or more healthcare data sources; filtering and removing duplicate copies of one or more pieces of the set of healthcare data; aggregating a subset of the set of healthcare data having the same type; partitioning the aggregated subset of healthcare data by healthcare data source; and storing the aggregated and partitioned subset of healthcare data in one or more long-term storage data stores; receiving a request from a processing node for a batch of healthcare data of a particular type and healthcare data source; and accessing the batch of healthcare data in the long-term storage data store; and delivering the batch of healthcare data to the processing node where the requested batch of healthcare data will be processed to produce a clinically relevant outcome. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification