Failover and recovery for replicated data instances
First Claim
1. A computer-implemented method of managing recovery of a replicated instance for a relational database instance from a control environment, comprising:
- under control of one or more computer systems configured with executable instructions,communicating with a primary instance replica and a secondary instance replica in a data environment using a monitoring component of a separate control environment, one or more responses received by the monitoring component including status information and data generation information for a respective one of the primary instance replica or the secondary instance replica; and
in response to the monitoring component being unable to communicate with the primary instance replica or the secondary instance replica,determining failure information, the failure information indicating whether the primary instance replica and the secondary instance replica are able to communicate with each other and whether the primary instance replica and the secondary instance replica have common data generation information; and
determining whether to perform a failover operation or a recover process based at least in part on the failure information.
0 Assignments
0 Petitions
Accused Products
Abstract
Replicated instances in a database environment provide for automatic failover and recovery. A monitoring component can periodically communicate with a primary and a secondary replica for an instance, with each capable of residing in a separate data zone or geographic location to provide a level of reliability and availability. A database running on the primary instance can have information synchronously replicated to the secondary replica at a block level, such that the primary and secondary replicas are in sync. In the event that the monitoring component is not able to communicate with one of the replicas, the monitoring component can attempt to determine whether those replicas can communicate with each other, as well as whether the replicas have the same data generation version. Depending on the state information, the monitoring component can automatically perform a recovery operation, such as to failover to the secondary replica or perform secondary replica recovery.
94 Citations
25 Claims
-
1. A computer-implemented method of managing recovery of a replicated instance for a relational database instance from a control environment, comprising:
under control of one or more computer systems configured with executable instructions, communicating with a primary instance replica and a secondary instance replica in a data environment using a monitoring component of a separate control environment, one or more responses received by the monitoring component including status information and data generation information for a respective one of the primary instance replica or the secondary instance replica; and in response to the monitoring component being unable to communicate with the primary instance replica or the secondary instance replica, determining failure information, the failure information indicating whether the primary instance replica and the secondary instance replica are able to communicate with each other and whether the primary instance replica and the secondary instance replica have common data generation information; and determining whether to perform a failover operation or a recover process based at least in part on the failure information. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A computer-implemented method of managing a replicated database instance in a data environment using a separate control environment, comprising:
under control of one or more computer systems configured with executable instructions, monitoring state information for each of a primary instance replica and a secondary instance replica in the data environment using a monitoring component of the separate control environment; and in response to the monitoring component being unable to communicate with at least the primary instance replica or the secondary instance replica; determining failure information including whether the primary instance replica and the secondary instance replica are able to communicate with each other and whether the primary instance replica and the secondary instance replica have a common data generation identifier; and determining whether to perform a failover operation or a recover process based at least in part on the failure information. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
15. A system for managing a replicated database instance in a data environment using a separate control environment, comprising:
-
a processor; and a memory device including instructions that, when executed by the processor, cause the processor to; monitor state information for each of a primary instance replica and a secondary instance replica in the data environment using at least one monitoring component of the separate control environment; and in response to the at least one monitoring component being unable to communicate with one of the primary instance replica or the secondary instance replica; determine failure information including whether the primary instance replica and the secondary instance replica arc able to communicate with each other and whether the primary instance replica and the secondary instance replica have a common data generation identifier; and determine whether to perform a failover operation or a recover process based at least in part on the failure information. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A non-transitory computer-readable storage medium storing instructions for managing a replicated database instance in a data environment using a separate control environment, the instructions when executed by a processor causing the processor to:
-
monitor state information for each of a primary instance replica and a secondary instance replica in the data environment using at least one monitoring component of the separate control environment; and in response to the at least one monitoring component being unable to communicate with one of the primary instance replica or the secondary instance replica; determine failure information including whether the primary instance replica and the secondary instance replica are able to communicate with each other and whether the primary instance replica and the secondary instance replica have a common data generation identifier. - View Dependent Claims (22, 23, 24, 25)
-
Specification