Method, apparatus and system for improving failover within a high availability disaster recovery environment
First Claim
1. A method comprising:
- obtaining a first parameter of a first cluster, whereinthe first cluster is operable to provide computing services to one or more client systems,data is replicated from the first cluster to a second cluster in accordance with a disaster recovery (“
DR”
) protocol,the second cluster is operable to provide the computing services to the client system(s) responsive to a failover from the first cluster, andthe first parameter of the first cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster;
monitoring a state of a second parameter of the second cluster in response to the replication of the data from the first cluster to the second cluster, whereinthe second parameter of the second cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster;
detecting, as a function of the first parameter and the state of the second parameter, at least one anomaly, whereinthe at least one anomaly indicates a mismatch between the first parameter and the state of the second parameter; and
generating an alert in response to detecting the at least one anomaly.
7 Assignments
0 Petitions
Accused Products
Abstract
A method, apparatus and system for improving failover within a high-availability computer system are provided. The method includes obtaining one or more parameters associated with a disaster recovery protocol of at least one resource of any of the first cluster, second cluster and high-availability computer system. The method also includes monitoring one or more states of the parameters. The method further includes detecting, as a function of the parameters and states, one or more anomalies of any of the first cluster, second cluster and high-availability computer system, wherein the anomalies are types that impact the failover. These anomalies may include anomalies associated with the disaster-recovery protocols within the first and/or second clusters (“intra-cluster anomalies”) and/or anomalies among the first and second clusters (“inter-cluster anomalies”). The method further includes generating an alert in response to detecting one or more of the anomalies.
71 Citations
20 Claims
-
1. A method comprising:
-
obtaining a first parameter of a first cluster, wherein the first cluster is operable to provide computing services to one or more client systems, data is replicated from the first cluster to a second cluster in accordance with a disaster recovery (“
DR”
) protocol,the second cluster is operable to provide the computing services to the client system(s) responsive to a failover from the first cluster, and the first parameter of the first cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; monitoring a state of a second parameter of the second cluster in response to the replication of the data from the first cluster to the second cluster, wherein the second parameter of the second cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; detecting, as a function of the first parameter and the state of the second parameter, at least one anomaly, wherein the at least one anomaly indicates a mismatch between the first parameter and the state of the second parameter; and generating an alert in response to detecting the at least one anomaly. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus comprising:
-
a parameter definition module comprising a first parameter of a first cluster, wherein the first cluster is operable to provide computing services to one or more client systems, data is replicated from the first cluster to a second cluster in accordance with a disaster recovery (“
DR”
) protocol,the second cluster is operable to provide the computing services to the client system(s) responsive to a failover from the first cluster, and the first parameter of the first cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; a state-monitoring module adapted to monitor a state of a second parameter of the second cluster in response to the replication of the data from the first cluster to the second cluster, wherein the second parameter of the second cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; and an anomaly-detection module adapted to detect, as a function of the first parameter and the state of the second parameter, at least one anomaly, wherein the at least one anomaly indicates a mismatch between the first parameter and the state of the second parameter. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A system for improving failover for an application within a high-availability computing system, the system comprising:
-
a parameter definition module comprising a first parameter of a first cluster, wherein the first cluster is operable to provide computing services to one or more client systems, data is replicated from the first cluster to a second cluster in accordance with a disaster recovery (“
DR”
) protocol,the second cluster is operable to provide the computing services to the client system(s) responsive to a failover from the first cluster, and the first parameter of the first cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; a state-monitoring module adapted to monitoring a state of a second parameter of the second cluster in response to the replication of the data from the first cluster to the second cluster, wherein the second parameter of the second cluster is accessed in accordance with the DR protocol when replicating the data from the first cluster to the second cluster; and an anomaly-detection module adapted to detect, as a function of the first parameter and the state of the second parameter, at least one anomaly, wherein the at least one anomaly indicates a mismatch between the first parameter and the state of the second parameter. - View Dependent Claims (18, 19, 20)
-
Specification