Framework for managing clustering and replication
First Claim
1. A method comprising:
- obtaining cluster status information for a plurality of clusters comprising a plurality of nodes;
obtaining replication status information for at least one of the plurality of nodes, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication for application data replication between the nodes, wherein application data is replicated in response to a write operation by an application to the application data; and
managing an operation from a node different from the at least one of the plurality of nodes using the cluster status information and the replication status information in combination, wherein the operation corresponds to one of a clustering operation and a replication operation; and
the managing is performed at least in part automatically.
9 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a framework for managing both clustering and data replication in a software system distributed across multiple nodes. The framework includes at least one agent running at nodes comprising the distributed system. The framework also includes a master to coordinate clustering and replication operations. The framework further includes a library of software programs, called primitives, that are used by agents to communicate with the master. The agent(s) obtain cluster status information and replication status information, which are used by the master to manage clustering and replication operations. The framework is designed to work with existing cluster management applications and data replication facilities. The framework provides status information needed for coordinating clustering and replication operations to ensure that applications and data remain in a consistent state for disaster recovery purposes.
-
Citations
36 Claims
-
1. A method comprising:
-
obtaining cluster status information for a plurality of clusters comprising a plurality of nodes; obtaining replication status information for at least one of the plurality of nodes, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication for application data replication between the nodes, wherein application data is replicated in response to a write operation by an application to the application data; and managing an operation from a node different from the at least one of the plurality of nodes using the cluster status information and the replication status information in combination, wherein the operation corresponds to one of a clustering operation and a replication operation; and the managing is performed at least in part automatically. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
obtaining cluster status information for a plurality of clusters comprising a plurality of nodes; obtaining replication status information for at least one of the plurality of nodes, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication for application data replication between the nodes, wherein the application data replication comprises replicating application data in response to a write operation by an application to the application data; automatically providing the cluster status information and the replication status information to a master operating on a node different from the at least one of the plurality of nodes; and managing an operation using the cluster status information and the replication status information in combination, wherein the operation corresponds to one of a clustering operation and a replication operation; and the managing is performed by the master.
-
-
16. A system comprising:
-
a node comprising a processor; a first agent configured to obtain cluster status information from a cluster management application; a second agent configured to obtain replication status information from a replication facility wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication; and a master, wherein the master is configured to manage an operation using the cluster status information and the replication status information in combination, at least one of the first agent and the second agent operates on the node, wherein the node is different from a node on which the master operates, the cluster management application is configured to perform a clustering operation for a plurality of nodes organized into a plurality of clusters, the replication facility is configured to perform a replication operation for replicating application data between the nodes, wherein the replication facility replicates application data in response to a write operation by an application to the application data, and the operation corresponds to one of the clustering operation and the replication operation. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. An interface comprising:
-
a computer readable storage device for storing a library comprising a primitive; at least one agent configured to implement at least one entry point to; obtain cluster status information from a cluster management application, obtain replication status information from a replication facility, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication, and call the primitive to communicate with a master, wherein the at least one agent operates on a node different from a node on which the master operates; wherein the cluster management application is configured to perform a clustering operation for a plurality of nodes organized into a plurality of clusters, the replication facility is configured to perform a replication operation to replicate application data between the nodes, the replication facility is configured to replicate the application data in response to a write operation by an application to the application data, the master is configured to manage an operation using the cluster status information and the replication status information in combination, and the operation corresponds to one of the clustering operation and the replication operation. - View Dependent Claims (28, 29, 30)
-
-
31. A system comprising:
-
a computer readable storage means for storing cluster status information; first obtaining means for obtaining the cluster status information for a plurality of clusters including a plurality of nodes; second obtaining means for obtaining replication status information for at least one of the plurality of nodes, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication for application data replication between the nodes, wherein the application data is replicated in response to a write operation by an application to the application data; and managing means for managing an operation from a node different from the at least one of the plurality of nodes using the cluster status information and the replication status information in combination, wherein the operation corresponds to one of a clustering operation and a replication operation. - View Dependent Claims (32, 33)
-
-
34. A computer program product comprising:
-
first obtaining instructions to obtain cluster status information for a plurality of clusters including a plurality of nodes; second obtaining instructions to obtain replication status information for at least one of the plurality of nodes, wherein the replication status information comprises information identifying a current state and a current latency of asynchronous data replication for application data replication between the nodes wherein the application data is replicated in response to a write operation by an application to the application data; managing instructions to manage an operation from a node different from the at least one of the plurality of nodes using the cluster status information and the replication status information in combination, wherein the operation corresponds to one of a clustering operation and a replication operation; and a computer readable storage medium to store the first and second obtaining instructions and the managing instructions. - View Dependent Claims (35, 36)
-
Specification