Failover method and system for a computer system having clustering configuration
First Claim
1. A failover control method for a computer system including a plurality of computers having a clustering configuration comprising:
- monitoring a second computer via a first line by a first computer among the plurality of computers;
detecting a malfunction of the second computer by the first computer;
receiving from the other computers a notification including a monitoring result for the second computer in the other computers among the plurality of computers having the clustering configuration by the first computer;
allowing information relating to a malfunction of the detected second computer and the monitoring result to correspond to each other;
judging whether or not the correspondence satisfies a predetermined condition; and
when the predetermined condition is satisfied, giving the second computer a reset instruction via a second line.
1 Assignment
0 Petitions
Accused Products
Abstract
A failover method for a computer system having a clustering configuration, in which among a plurality of computers having the clustering configuration, any one of computers, when detecting a malfunction of a system including a certain computer, transmits a detection of the system malfunction to computers configuring the other systems, and the any one of computers, when detecting the malfunction of the system including the certain computer and receiving malfunction notifications of the system including the certain computer from the computers configuring the other systems, issues a reset request to the certain computer.
-
Citations
16 Claims
-
1. A failover control method for a computer system including a plurality of computers having a clustering configuration comprising:
-
monitoring a second computer via a first line by a first computer among the plurality of computers; detecting a malfunction of the second computer by the first computer; receiving from the other computers a notification including a monitoring result for the second computer in the other computers among the plurality of computers having the clustering configuration by the first computer; allowing information relating to a malfunction of the detected second computer and the monitoring result to correspond to each other; judging whether or not the correspondence satisfies a predetermined condition; and when the predetermined condition is satisfied, giving the second computer a reset instruction via a second line. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer system comprising three or more of computers having a clustering configuration, wherein:
-
any one of the computers is a currently-active system computer that executes a predetermined application, and the other computers are standby system computers; each of the computers has a processor, a memory connected with the processor, a first network interface connected with the processor, and a second network interface; among the computers, when the processor of a first computer detects a communication malfunction with a second computer among the computers via the first network interface, the processor judges whether or not malfunction information on a communication with the second computer is received via the first network interface from computers other than the second computer; when the malfunction information is received, stores the malfunction information in the memory, refers to the memory, and calculates how many computers the malfunction information is received from; and as a result of the calculation, when the malfunction information is received from the predetermined number of computers, issues a reset request to the second computer via the second network interface. - View Dependent Claims (12, 13, 14)
-
-
15. A reset control device connected with a plurality of computers having a clustering configuration via a network, comprising:
-
a network interface connected with the network; a processor connected with the network; and a memory connected with the processor, wherein; the processor receives malfunction information relating to a communication malfunction with any one of the computers via the network interface; stores the malfunction information in the memory; based on the malfunction information stored in the memory, judges whether or not to receive the information from a predetermined number of computers; and as a result of the judgment, when receiving the information from the predetermined number of computers, issues a reset via the network interface to a computer in which a malfunction occurs. - View Dependent Claims (16)
-
Specification