Distributed failover aware storage area network backup of application data in an active-N high availability cluster
First Claim
1. A method for backing up a cluster having one or more nodes, comprising:
- providing a primary coordinator executing on a first node of the cluster;
providing a backup service for each of the nodes, each backup service operative with the primary coordinator;
detecting the failure of a node hosting the primary coordinator;
moving the primary coordinator to a second node of the cluster;
initializing the primary coordinator on the second node;
obtaining failover information by the primary coordinator from a cluster service of the second node.
14 Assignments
0 Petitions
Accused Products
Abstract
A SAN-based cluster backup system and method are provided. The system and method are automated, do not use a LAN for backup data, and are made aware of application failover events. The system and method are composed of two main components: a backup service, and a primary coordinator. The backup service performs the backup of the applications that are hosted on a particular node. The backup service periodically checkpoints the state of the backup job and communicates the status to the primary coordinator. The primary coordinator controls all backup operations in the cluster. The user submits backup jobs for the applications through the primary coordinator. If a node fails during a backup operation, the primary coordinator can ensure that the failed backup job can be resumed from the last checkpoint on the failed-over node. In this way, repetitive backups can be avoided, thereby increasing efficiency.
65 Citations
25 Claims
-
1. A method for backing up a cluster having one or more nodes, comprising:
-
providing a primary coordinator executing on a first node of the cluster; providing a backup service for each of the nodes, each backup service operative with the primary coordinator; detecting the failure of a node hosting the primary coordinator; moving the primary coordinator to a second node of the cluster; initializing the primary coordinator on the second node; obtaining failover information by the primary coordinator from a cluster service of the second node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a cluster having two or more nodes, each of the nodes constructed and arranged to execute at least one application, each of said nodes further executing a backup service; a primary coordinator executing on one of the two or more nodes, the primary coordinator constructed and arranged to coordinate the backup of data on the nodes by communicating backup information to the backup services. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A cluster having one or more nodes, each of the nodes being a computer having one or more microprocessors, and having memory operative with the one or more microprocessors, the cluster further having at least one SAN that is operative with each of the nodes, the cluster comprising:
-
a backup service on each of the nodes; and a primary coordinator executing on one of the nodes, the primary coordinator constructed and arranged to direct the backup service on each of said nodes to backup data on the node through the SAN.
-
Specification