NON-RELATIONAL FUNCTION-BASED DATA PUBLICATION FOR RELATIONAL DATA
First Claim
1. A computer-implemented method for gathering and aggregating data from a variety of heterogeneous data sources for publication as a unified dataset, the method comprising:
- receiving source data from one or more data sources;
copying the data locally to a combined data store;
determining semantic information related to the received source data;
publishing the received data aggregated from multiple data sources along with determined semantic information to one or more data consumers;
identifying a target relational database instance to which to publish aggregated non-relational data; and
generating one or more replication functions that convert data from a non-relational source format to a target relational database format associated with the identified target relational database instance,wherein the preceding steps are performed by at least one processor.
1 Assignment
0 Petitions
Accused Products
Abstract
A data publication system is described herein that provides a data replication model that combines benefits of data distribution from non-relational paradigms with the benefits of deeply integrating datasets via relational database paradigms. The system allows the creation of programmatic functions for extracting subsets of data stored in any source model, extracting data from a variety of sources, and republishing that data in a target model built upon the aggregated source data. The target model can provide standard relational paradigms across a set of data from multiple sources, whether or not the original sources were relational in nature. The system applies known paradigms for data replication based upon programmatic functions as a means for data replication and integrates this method for data duplication and replication based upon arbitrary functions with the power of relational database systems to process associated entities of data in highly efficient ways.
-
Citations
20 Claims
-
1. A computer-implemented method for gathering and aggregating data from a variety of heterogeneous data sources for publication as a unified dataset, the method comprising:
-
receiving source data from one or more data sources; copying the data locally to a combined data store; determining semantic information related to the received source data; publishing the received data aggregated from multiple data sources along with determined semantic information to one or more data consumers; identifying a target relational database instance to which to publish aggregated non-relational data; and generating one or more replication functions that convert data from a non-relational source format to a target relational database format associated with the identified target relational database instance, wherein the preceding steps are performed by at least one processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system for Non-Relational Function-Based Data Publication for Relational Data, the system comprising:
-
a processor and memory configured to execute software instructions embodied within the following components; a local aggregation component that retrieves data from one or more data sources and collects the data in a combined data store; a combined data store that stores data gathered from the data sources for publication by an aggregate publication component; a semantic mapping component that determines semantic information about data gathered from the data sources; an aggregate publication component that publishes gathered data in accordance with the determined semantic information to one or more data destinations; a replication function component that generates one or more functions for replicating a portion of the published data originally from the data sources to one or more relational database instances; and a data distribution component that distributes data published by the aggregate publication component to one or more target relational database instances by applying the generated functions for replicating data. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer-readable storage medium comprising instructions for controlling a computer system to publish an aggregated, non-relational dataset to one or more relational database instances, wherein the instructions, upon execution, cause a processor to perform actions comprising:
-
waiting for new non-relational source data to arrive for replication to one or more relational database target instances; receiving published non-relational data aggregated from one or more distributed data sources; identifying a relational database target instance located remotely that supports a particular client application and that relies on data provided from the data catalog of the system without known an original source or format of the data; applying one or more relational mapping functions that distribute the aggregated non-relational data to the identified relational database target instance; and replicating the mapped data to the identified relational database target instance.
-
Specification