FAILURE RECOVERY AND ERROR CORRECTION TECHNIQUES FOR DATA LOADING IN INFORMATION WAREHOUSES
First Claim
1. A method comprising:
- performing checkpointing concurrently with data loading into an information warehouse, the checkpointing ensuring consistency among multiple tables; and
recovering from a failure in the data loading using the checkpointing.
0 Assignments
0 Petitions
Accused Products
Abstract
A method of data loading for large information warehouses includes performing checkpointing concurrently with data loading into an information warehouse, the checkpointing ensuring consistency among multiple tables; and recovering from a failure in the data loading using the checkpointing. A method is also disclosed for performing versioning concurrently with data loading into an information warehouse. The versioning method enables processing undo and redo operations of the data loading between a later version and a previous version. Data load failure recovery is performed without starting a data load from the beginning but rather from a latest checkpoint for data loading at an information warehouse level using a checkpoint process characterized by a state transition diagram having a multiplicity of states; and tracking state transitions among the states using a system state table.
-
Citations
37 Claims
-
1. A method comprising:
-
performing checkpointing concurrently with data loading into an information warehouse, the checkpointing ensuring consistency among multiple tables; and recovering from a failure in the data loading using the checkpointing. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
performing versioning concurrently with data loading into an information warehouse; and processing an undo operation of the loaded data from a current version to a previous version using the versioning. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
performing failure recovery without starting a data load from the beginning of a source file but rather from a latest checkpoint for data loading using a checkpoint process performed at an information warehouse level, and characterized by a state transition diagram having a plurality of states; and tracking state transitions among the states using a system state table. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. An information warehouse system comprising:
-
a source file version table; a plurality of individual data base (DB) tables each of which contains versioning information in addition to data base information; and a versioning mechanism that accesses the source file version table and any combination of the DB tables to perform update, undo, and redo operations of data loading for an information warehouse. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A computer program product for use with an information warehouse, the computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
process a data loading task according to a state transition diagram that has checkpoint states; track state transitions among states of the state transition diagram using a system state table; and wherein in response to a data loading failure; completing the data loading task does not require entirely repeating the data loading task; and completing the data loading task comprises reloading data from a most recent checkpoint state. - View Dependent Claims (26, 27, 28, 29, 30)
-
-
31. A computer program product for use with an information warehouse, the computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
maintain a source file version table including; source data file information for data loading of a plurality of source data files; current version information for each of the source data files; maximum version information for each of the source data files; maintain a plurality of individual data base (DB) tables each of which includes creation version information for each record of each source data file; and perform update, undo, and redo operations of data loading using the source file version table and the DB tables. - View Dependent Claims (32, 33, 34, 35, 36, 37)
-
Specification