DATA WAREHOUSE COMPATIBILITY
First Claim
1. A system for establishing compatibility between an open-source data warehouse and a proprietary data warehouse, the system comprising:
- a distribution processing module configured to receive a data stream;
a distributed file system configured to store at least a portion of the data stream according to a first data format; and
a first data warehouse configured to an execute extract, transform, and load (ETL) operation on the stored portion of the data stream;
wherein the first data warehouse includes a compatibility processing module configured to execute one or more transformation processes on the stored portion of the data stream; and
wherein the compatibility processing module formats the stored portion of the data stream according to a second data format such that the stored portion of the data stream is compatible with a second data warehouse that uses the second data format.
1 Assignment
0 Petitions
Accused Products
Abstract
A compatibility processing module, for executing one or more processes to format and manipulate data, such that communication between previously-incompatible data warehouses is facilitated. In particular, a first warehouse is disclosed, wherein the first data warehouse is configured with a compatibility processing module, for receiving a large number of data points, and for executing one or more processes on a stored portion of the received data points such that the resulting processed data points are compatible with formatting conventions of a second data warehouse.
-
Citations
20 Claims
-
1. A system for establishing compatibility between an open-source data warehouse and a proprietary data warehouse, the system comprising:
-
a distribution processing module configured to receive a data stream; a distributed file system configured to store at least a portion of the data stream according to a first data format; and a first data warehouse configured to an execute extract, transform, and load (ETL) operation on the stored portion of the data stream; wherein the first data warehouse includes a compatibility processing module configured to execute one or more transformation processes on the stored portion of the data stream; and wherein the compatibility processing module formats the stored portion of the data stream according to a second data format such that the stored portion of the data stream is compatible with a second data warehouse that uses the second data format. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium having computer-executable program instructions stored thereon that when executed by a processor cause the processor to perform steps establishing compatibility between a first data warehouse and a second data warehouse, the steps comprising:
-
receiving, from a distribution processing module, a data stream; storing, in a distributed file system, at least a portion of the data stream according to a first data format; executing, using the first data warehouse, an extract, transform, and load (ETL) operation on the stored portion of the data stream; and executing, using a compatibility processing module, one or more transformation processes on the stored portion of the data stream, according to a second data format such that the stored portion of the data stream is compatible with a second data warehouse that uses the second data format. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of establishing compatibility between a first data warehouse and a second data warehouse comprising:
-
receiving a data stream, by a distribution processing module, wherein the data stream comprises raw data; storing at least a portion of the data stream in a distributed file system according to a first data format; executing an extract, transform, and load (ETL) operation on the stored portion of the data stream; executing one or more transformation processes on the stored portion of the data stream; and formatting the stored portion of the data stream according to a second data format such that the stored portion of the data stream is compatible with a second data warehouse that uses the second data format. - View Dependent Claims (20)
-
Specification