Techniques for performing ETL over a WAN
First Claim
Patent Images
1. A computer-readable non-transitory storage medium storing a plurality of instructions controlling one or more processors to facilitate communication of data from a source data store to a target data store, the plurality of instructions comprising:
- instructions that cause first data to be extracted by at least one first asynchronous process at the source data store;
instructions that cause a first transportable storage unit to be created by the asynchronous process at the source data store, the first transportable storage unit comprising the first data;
instructions that cause the first transportable storage unit to be broken up into at least a second transportable storage unit and a third transportable storage unit in response to a determination that a size of the first transportable storage unit will cause a transmission of the first transportable storage unit from the source data store to the target data store over a communication network to be performed at less than a specified threshold speed;
instructions that cause the second transportable storage unit to be pushed by the asynchronous process at the source data store from the source data store to the target data store over the communication network;
instructions that cause at least a portion of the first data to be extracted from the second transportable storage unit by at least one asynchronous process at the target data store;
instructions that cause one or more transformations to be performed on said at least a portion of said first data by the asynchronous process at the target data store to generate transformed data; and
instructions that cause the transformed data to be loaded at the target data store by the asynchronous process at the target data store.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques that enable data extraction, transfer, and load (ETL) operations to be performed asynchronously over communication networks including a Wide Area Network (WAN).
-
Citations
19 Claims
-
1. A computer-readable non-transitory storage medium storing a plurality of instructions controlling one or more processors to facilitate communication of data from a source data store to a target data store, the plurality of instructions comprising:
-
instructions that cause first data to be extracted by at least one first asynchronous process at the source data store; instructions that cause a first transportable storage unit to be created by the asynchronous process at the source data store, the first transportable storage unit comprising the first data; instructions that cause the first transportable storage unit to be broken up into at least a second transportable storage unit and a third transportable storage unit in response to a determination that a size of the first transportable storage unit will cause a transmission of the first transportable storage unit from the source data store to the target data store over a communication network to be performed at less than a specified threshold speed; instructions that cause the second transportable storage unit to be pushed by the asynchronous process at the source data store from the source data store to the target data store over the communication network; instructions that cause at least a portion of the first data to be extracted from the second transportable storage unit by at least one asynchronous process at the target data store; instructions that cause one or more transformations to be performed on said at least a portion of said first data by the asynchronous process at the target data store to generate transformed data; and instructions that cause the transformed data to be loaded at the target data store by the asynchronous process at the target data store. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a memory; and a processor coupled to the memory, the processor configured to; extract first data from the memory by at least one asynchronous process at a source data store; create a first transportable storage unit comprising the first data by the asynchronous process at the source data store; break up the first transportable storage unit into at least a second transportable storage unit and a third transportable storage unit in response to a determination that a size of the first transportable storage unit will cause a transmission of the first transportable storage unit from the source data store to the target data store over a communication network to be performed at less than a specified threshold speed; and push the second transportable storage unit to a target data store over a communication network by at least one asynchronous process at the target data store, at least a portion of the first data being extracted from the second transportable storage unit at the target data store and transformed into transformed data by the asynchronous process at the target data store, the transformed data being loaded at the target data store by the asynchronous process at the target data store. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification