Systems and methods for automated maintenance and repair of database and file systems
First Claim
Patent Images
1. An automated data reliability system (DRS) implemented at least in part by a computing device for a database file system (DBFS), said DRS comprising:
- a subsystem for performing database administration (DBA) tasks based on a set of policies;
a subsystem for responding to a set of data corruptions at a page level;
a subsystem for a first level of recovery for attempting to repair a corrupted page;
a subsystem for a second level of recovery for attempting to rebuild or restore a corrupted page if said corrupted page cannot be repaired;
a subsystem for evaluating a database associated with said DBFS to determine if the database is transactionally consistent; and
a subsystem for delaying repairs if said database is transactionally consistent until a crash recovery is complete, otherwise proceeding with repairs without the delaying.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates generally to database and file system management and, more particularly, to automatic database and file system maintenance and repair to ensure data reliability. Various aspects of the present invention relate to responding and correcting data corruptions at a data page level for all data page types, as well as to recovery (including rebuild or restore operations) for various scenarios including, without limitation, index page corruptions (clustered and non-clustered), data page corruptions, and page corruptions in the log file.
-
Citations
27 Claims
-
1. An automated data reliability system (DRS) implemented at least in part by a computing device for a database file system (DBFS), said DRS comprising:
-
a subsystem for performing database administration (DBA) tasks based on a set of policies; a subsystem for responding to a set of data corruptions at a page level; a subsystem for a first level of recovery for attempting to repair a corrupted page; a subsystem for a second level of recovery for attempting to rebuild or restore a corrupted page if said corrupted page cannot be repaired; a subsystem for evaluating a database associated with said DBFS to determine if the database is transactionally consistent; and a subsystem for delaying repairs if said database is transactionally consistent until a crash recovery is complete, otherwise proceeding with repairs without the delaying. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for attaching and bringing online a data store for a database file system (DBFS), said system comprising:
-
a subsystem for attaching said data store; a subsystem for running a crash recovery and attempting to bring the data store online, wherein a set of data corruptions occurs at a page level, wherein a first level of recovery attempts to repair a corrupted page and wherein a second level of recovery attempts to rebuild or restore a corrupted page if said corrupted page cannot be repaired; a subsystem for determining if the attempt to bring the data store online is not successful; a subsystem for evaluating a database associated with said data store to determine if the database is transactionally consistent; a subsystem for delaying repairs if said database is transactionally consistent until said crash recovery is complete, otherwise proceeding with repairs without the delaying. - View Dependent Claims (8, 9)
-
-
10. A method implemented at least in part by a computing device for automated data reliability system (DRS) for a database file system (DBFS), said method comprising:
-
establishing a set of policies; performing database administration (DBA) tasks based on the set of policies; responding to a set of data corruptions at a page level for all page types; performing a first level of recovery by attempting to repair a corrupted page; performing a second level of recovery by attempting to rebuild or restore a corrupted page if said corrupted page cannot be repaired; evaluating a database associated with said DBFS to determine if the database is transactionally consistent; and delaying repairs if the database is tranactionally consistent until a crash recovery is complete, otherwise proceeding with repairs without the delaying. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method for attaching and bringing online a data store for a database file system (DBFS), said method comprising:
-
attaching said data store; running a crash recovery and attempting to bring the data store online, wherein a set of data corruptions occurs at a page level, wherein a first level of recovery attempts to repair a corrupted page and wherein a second level of recovery attempts to rebuild or restore a corrupted corrupted page if said corrupted page cannot be repaired; determining if the attempt to bring the data store online is not successful; evaluating a database associated with said data store to determine if the database is transactionally consistent; and
;delaying repairs if the database is transactionally consistent until said crash recovery is complete. - View Dependent Claims (17, 18)
-
-
19. A computer-readable storage medium having at least one tangible media, the tangible media comprising computer-readable instructions for automated data reliability system (DRS) for a database file system (DBFS), said computer-readable instructions comprising instructions for:
-
performing database administration (DBA) tasks based on a set of policies; responding to a set of data corruptions at a page level for all page types; performing a first level of recovery by attempting to repair a corrupted page; performing a second level of recovery by attempting to rebuild or restore a corrupted page if said corrupted page cannot be repaired; and evaluating a database associated with said DBFS to determine if the database is transactionally consistent and, if transactionally consistent, delaying repairs until a crash recovery is complete, otherwise proceeding with repairs without the delaying. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A computer-readable storage medium having at least one tangible media, the tangible media comprising computer-readable instructions for attaching and bringing online a data store for a database file system (DBFS), said computer-readable instructions comprising instructions for:
-
attaching said data store; running a crash recovery and attempting to bring the data store online, wherein a set of data corruptions occurs at a page level, wherein a first level of recovery attempts to repair a corrupted page and wherein a second level of recovery attempts to rebuild or restore a corrupted page if said corrupted page cannot be repaired; determining if the attempt to bring the data store online is not successful; evaluating a database associated with said data store to determine if the database is transactionally consistent; and delaying repairs if the database is transactionally consistent until said crash recovery is complete. - View Dependent Claims (26, 27)
-
Specification