Rule-based extraction, transformation, and loading of data between disparate data sources
First Claim
1. A method, comprising:
- accessing, by a computing system, an operational log to detect an update to a transactional database, wherein the transactional database updates the operational log when transactional data is received, the transactional data generated when Internet marketing is performed and comprising an attribute for a keyword, an ad, a group that includes certain keywords or ads, or a marketing campaign;
performing, by the computing system and in response to detecting the update, operations comprising;
extracting a subset of data from the operational log without accessing the transactional database, wherein the extracting includes identifying attribute-based data corresponding to the attribute, andtransforming the extracted subset of data into transformed data by aggregating the subset of data having a same attribute value for the attribute and occurring within a specific time range;
accessing, by the computing system, a data object having an attribute field and a metrics field corresponding to the specific time range, the data object being indexed by the attribute field; and
updating, by the computing system, the data object by adding the transformed data to the attribute field and adding metrics data from an analytics database to the metrics field, the metrics data corresponding to the specific time range.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus are presented for extracting, transforming, and loading data from one database to another database. For example, an extraction, transformation, and loading (ETL) component may access an operational log of a given database in order to detect an update to the database. Upon detecting the update, the ETL component may extract a subset of data from the operational log, where the extraction of the subset of data is based on one or more rules. Ones the subset of data has been extracted, the ETL component may transform the extracted subset of data from the operational log into a format for another, target database, where the data format for the other, target database is different from a data format for the given, source database. The ETL component may then load the subset of data transformed into the data format for the other, target database into the target database.
-
Citations
20 Claims
-
1. A method, comprising:
-
accessing, by a computing system, an operational log to detect an update to a transactional database, wherein the transactional database updates the operational log when transactional data is received, the transactional data generated when Internet marketing is performed and comprising an attribute for a keyword, an ad, a group that includes certain keywords or ads, or a marketing campaign; performing, by the computing system and in response to detecting the update, operations comprising; extracting a subset of data from the operational log without accessing the transactional database, wherein the extracting includes identifying attribute-based data corresponding to the attribute, and transforming the extracted subset of data into transformed data by aggregating the subset of data having a same attribute value for the attribute and occurring within a specific time range; accessing, by the computing system, a data object having an attribute field and a metrics field corresponding to the specific time range, the data object being indexed by the attribute field; and updating, by the computing system, the data object by adding the transformed data to the attribute field and adding metrics data from an analytics database to the metrics field, the metrics data corresponding to the specific time range. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable storage medium storing program instructions, wherein the program instructions are computer executable to implement:
aggregating data from at least two different data sources, wherein said aggregating comprises; accessing an operational log to detect an update to a transactional database, wherein the transactional database updates the operational log when transactional data is received, the transactional data generated when Internet marketing is performed and comprising an attribute for a keyword, an ad, a group that includes certain keywords or ads, or a marketing campaign; performing, in response to detecting the update, operations comprising; (i) extracting a subset of data from the operational log without accessing the transactional database, wherein the extracting includes identifying attribute-based data corresponding to the attribute, and (ii) transforming the extracted subset of data into transformed data by aggregating the subset of data having a same attribute value for the attribute and occurring within a specific time range; accessing a data object having an attribute field and a metrics field corresponding to the specific time range, the data object being indexed by the attribute field; and updating, the data object by adding the transformed data to the attribute field and adding metrics data from an analytics database to the metrics field, the metrics data corresponding to the specific time range. - View Dependent Claims (13, 14, 15)
-
16. A system, comprising:
-
at least one processing device; and a memory device communicatively coupled to the at least one processing device and comprising program instructions, wherein the at least one processing devices is configured for executing the program instructions and thereby aggregating data from at least two different data sources, wherein the aggregating comprises; accessing an operational log to detect an update to a transactional database, wherein the transactional database updates the operational log when transactional data is received, the transactional data generated when Internet marketing is performed and comprising an attribute for a keyword, an ad, a group that includes certain keywords or ads, or a marketing campaign, performing, in response to detecting the update, operations comprising; (i) extracting a subset of data from the operational log without accessing the transactional database, wherein the extracting includes identifying attribute-based data corresponding to the attribute, and (ii) transforming the extracted subset of data into transformed data by aggregating the subset of data having a same attribute value for the attribute and occurring within a specific time range, accessing a data object having an attribute field and a metrics field corresponding to the specific time range, the data object being indexed by the attribute field, and updating, the data object by adding the transformed data to the attribute field and adding metrics data from an analytics database to the metrics field, the metrics data corresponding to the specific time range. - View Dependent Claims (17, 18, 19, 20)
-
Specification