Historical data warehousing system
First Claim
Patent Images
1. A method, comprising:
- a) obtaining data from a source system;
b) pre-processing, utilizing a pre-processor, the obtained data by a stepwise operation to generate pre-processed data, wherein the pre-processing comprises recording only last operated upon obtained data whereby data recording is avoided during data addition for efficiency purposes;
c) transforming the pre-processed data into subject-oriented data by utilizing reusable primary keys and Relational Database Management System dates in an operating system in the source system to link related pre-processed data; and
d) storing the subject oriented data in a data warehouse, wherein the Relational Database Management System dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the data warehouse;
wherein the dates within the Relational Database Management System of the source system are obtained by log-scraping the Relational Database Management System.
2 Assignments
0 Petitions
Accused Products
Abstract
A method comprises obtaining data from a source system. Further, the obtained data is pre-processed by a stepwise operation to generate pre-processed data. The last operated upon data is recorded. In addition, the pre-processed data is transformed into subject-oriented data by utilizing reusable primary keys and Relational Database Management System dates in the source system to link related pre-processed data. Additionally, the subject-oriented data is stored in a data warehouse. The Relational Database Management System dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the data warehouse.
-
Citations
14 Claims
-
1. A method, comprising:
-
a) obtaining data from a source system; b) pre-processing, utilizing a pre-processor, the obtained data by a stepwise operation to generate pre-processed data, wherein the pre-processing comprises recording only last operated upon obtained data whereby data recording is avoided during data addition for efficiency purposes; c) transforming the pre-processed data into subject-oriented data by utilizing reusable primary keys and Relational Database Management System dates in an operating system in the source system to link related pre-processed data; and d) storing the subject oriented data in a data warehouse, wherein the Relational Database Management System dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the data warehouse; wherein the dates within the Relational Database Management System of the source system are obtained by log-scraping the Relational Database Management System. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method, comprising:
-
a) obtaining data from a source system; b) pre-processing, utilizing a pre-processor, the obtained data by a stepwise operation to generate pre-processed data, wherein the pre-processing comprises recording only last operated upon obtained data whereby data recording is avoided during data addition for efficiency purposes; c) transforming the pre-processed data into subject-oriented data by utilizing reusable primary keys and Relational Database Management System dates in an operating system in the source system to link related pre-processed data; and d) storing the subject-oriented data in a data warehouse, wherein the Relational Database Management System dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the data warehouse; wherein the pre-processing includes at least one of an ignore function, an insert function, an update function, and a replicate function; wherein the pre-processing associated with the insert function returns a warning when associated subject-oriented data already exists in the data warehouse.
-
-
13. A method, comprising:
-
a) obtaining data records from a source system; b) pre-processing, utilizing a pre-processor, the obtained data records to generate pre-processed data records, wherein the pre-processing comprises operating on each obtained data record in a serial manner, adding new data to a prior operated-on record with an entry being recorded for a last serially operated-on record, whereby data recording is avoided during data addition for efficiency purposes; c) transforming the pre-processed data records into related subject-oriented data records, wherein the transforming comprises linking related pre-processed data records together by way of reusable primary keys on the source system and dates obtained by trigger within a Relational Database Management System in the source system; and d) storing the related subject-oriented data records in a historical data warehouse, wherein the Relational Database Management System dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the historical data warehouse.
-
-
14. A method, comprising:
-
a) obtaining data records from a legacy source system; b) pre-processing, utilizing a pre-processor, the obtained data records to generate pre-processed data records, wherein the pre-processing comprises operating on each obtained data record in a stepwise manner, adding new data to a prior operated-on record with an entry being recorded for the obtained data record having a last stepwise operation; c) transforming the pre-processed data records into related subject-oriented data records, wherein the transforming comprises linking related pre-processed data records together by way of reusable primary keys on the source system and dates obtained by log-scraping a Relational Database Management System of the legacy source system; and d) storing the related subject-oriented data records in a data warehouse; wherein the dates are utilized for distinctly characterizing the subject-oriented data when a plurality of tables containing data with duplicate primary keys are combined in the data warehouse.
-
Specification