FAILURE RECOVERY SYSTEM AND SERVER
First Claim
1. A failure recovery system comprising:
- one or more network devices constituting a network; and
a server connected to said network devices, and including a scenario table in which object to-be-monitored information that indicates said network device or devices being objects for failure recovery, failure information for identifying contents of failures, and countermeasure information against the failures, and frequence information that indicates the number of times of recoveries from the failures based on the countermeasure information are correspondingly stored;
wherein;
said network device detects the failure of said network device itself, and transmits to said server, a failure event which contains the object to-be-monitored information indicating said device itself, and the failure information for identifying content of the failure;
said server receives the failure event, searches for one or more countermeasure information items corresponding to the object to-be-monitored information and the failure information which are contained in the failure event, by reference to the scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said server transmits the selected countermeasure information item to said network device;
said network device receives the countermeasure information item, and reflects the countermeasure information item or alters its setting on the basis of the countermeasure information item; and
when said server judges that the failure event is not received again within a predetermined time period since the transmission of the selected countermeasure information item, said server increases the frequence information item corresponding to the selected countermeasure information item, by reference to the scenario table.
1 Assignment
0 Petitions
Accused Products
Abstract
A server 200 includes a scenario table in which object to-be-monitored information that indicates one or more network devices A, B and C being objects for failure recovery, failure information for identifying contents of failures, countermeasure information against failures, and frequence information that indicates the number of times of the recoveries from the failures based on the countermeasure information are correspondingly stored. The network device A 300 detects the failure of the network device itself and transmits a failure event to the server 200. The server 200 selects the countermeasure information item in descending order of the frequence information items, by reference to the scenario table, and transmits the selected countermeasure information item to the network device A 300. The server 200 repeats the selections and transmissions of the pertinent information item until the reception of the failure event from the network device A 300 stops.
-
Citations
19 Claims
-
1. A failure recovery system comprising:
-
one or more network devices constituting a network; and
a server connected to said network devices, and including a scenario table in which object to-be-monitored information that indicates said network device or devices being objects for failure recovery, failure information for identifying contents of failures, and countermeasure information against the failures, and frequence information that indicates the number of times of recoveries from the failures based on the countermeasure information are correspondingly stored;
wherein;
said network device detects the failure of said network device itself, and transmits to said server, a failure event which contains the object to-be-monitored information indicating said device itself, and the failure information for identifying content of the failure;
said server receives the failure event, searches for one or more countermeasure information items corresponding to the object to-be-monitored information and the failure information which are contained in the failure event, by reference to the scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said server transmits the selected countermeasure information item to said network device;
said network device receives the countermeasure information item, and reflects the countermeasure information item or alters its setting on the basis of the countermeasure information item; and
when said server judges that the failure event is not received again within a predetermined time period since the transmission of the selected countermeasure information item, said server increases the frequence information item corresponding to the selected countermeasure information item, by reference to the scenario table. - View Dependent Claims (3, 4, 13, 16, 19)
-
-
2. A server comprising:
-
an interface for communicating with one or more network devices which constitute a network;
a scenario table in which object to-be-monitored information that indicates the network device or devices being objects for failure recoveries, failure information for identifying contents of failures, countermeasure information against the failures, and frequence information that indicates the number of times of recoveries from the failures based on the countermeasure information are correspondingly stored; and
a processing unit;
wherein;
said processing unit receives through said interface, a failure event which is transmitted when the network device detects the failure, and which contains the object to-be-monitored information indicating the network device, and the failure information for identifying the content of the failure;
said processing unit searches for one or more countermeasure information items corresponding to the object to-be-monitored information and the failure information that are contained in the failure event, by reference to said scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said processing unit transmits the selected countermeasure information item to the network device through said interface; and
in a case where the failure event is not received again within a predetermined time period since the transmission of the selected countermeasure information item, said processing unit increases the frequence information item corresponding to the selected countermeasure information item, by reference to said scenario table.
-
-
5. A failure recovery system comprising:
-
a first network device constituting a network;
a second network device connected to said first network device, and constituting the network; and
a server connected to said first and second network devices, and including a scenario table in which object to-be-monitored information that indicates said first and second network devices being objects for recovery from a failure, failure information that is determined by a combination of a status of said first network device and a status of said second network device, countermeasure information against the failure, and frequence information that indicates the number of times of the recoveries from the failure based on the countermeasure information are correspondingly stored;
wherein;
said first network device transmits to said server, a first event which contains first object to-be-monitored information indicating said device itself, and first status information indicating the status of said device itself;
said second network device transmits to said server, a second event which contains second object to-be-monitored information indicating said device itself, and second status information indicating the status of said device itself;
said server receives the first and second events, judges existence or nonexistence of the failure on the basis of the first status information and the second status information, and finds failure information;
said server searches for one or more countermeasure information items which correspond to the first and second object to-be-monitored information items respectively contained in the first and second events, and the found failure information, by reference to the scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said server transmits the selected countermeasure information item to said first and second network devices, respectively;
said first and second network devices receive the countermeasure information item, and reflect said countermeasure information item or alter their settings on the basis of the countermeasure information item, respectively; and
when the failure is avoided, said server increases the frequence information item corresponding to the selected countermeasure information item, by reference to the scenario table. - View Dependent Claims (7, 8, 14, 17)
-
-
6. A server comprising:
-
an interface for communicating with first and second network devices which constitute a network;
a scenario table in which object to-be-monitored information that indicate the first and second network devices being objects for recovery from a failure, failure information that is determined by a combination of a status of the first network device and a status of the second network device, countermeasure information against the failure, and frequence information that indicates the number of times of the recoveries from the failure based on the countermeasure information are correspondingly stored; and
a processing unit;
wherein;
said processing unit receives a first event which contains first object to-be-monitored information indicating the first network device, and first status information indicating the status of the first network device, from the first network device and through said interface;
said processing unit receives a second event which contains second object to-be-monitored information indicating the second network device, and second status information indicating the status of the second network device, from the second network device and through said interface;
said processing unit judges existence or nonexistence of the failure on the basis of the first status information and the second status information, and finds failure information;
said processing unit searches for one or more countermeasure information items which correspond to the first and second object to-be-monitored information items respectively contained in the first and second events, and the found failure information, by reference to said scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest or, is equal to or larger than a predetermined value;
said processing unit transmits the selected countermeasure information item to the first and second network devices through said interface, respectively; and
when the failure is avoided, said processing unit increases the frequence information corresponding to the selected countermeasure information item, by reference to said scenario table.
-
-
9. A failure recovery system comprising:
-
a first network device constituting a network;
a second network device constituting the network;
a third network device connected to the network through said first network device, and connected to the network through said second network device; and
a server connected to said first and second network devices, and including a scenario table in which object to-be-monitored information that indicates the network device or devices being objects for failure recovery, failure information for identifying contents of failures, countermeasure information against the failures, and frequence information that indicates the number of times of recoveries from the failures based on the countermeasure information are correspondingly stored;
wherein;
when said third network device detects that a failure has occurred in a transfer to the network, due to the failure of said first or second network device, said third network device transmits to said server, a failure event which contains the object to-be-monitored information indicating said device itself, and failure information for identifying the failure of a transfer function;
said server receives the failure event, searches for one or more countermeasure information items corresponding to the object to-be-monitored information and the failure information which are contained in the failure event, by reference to the scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said server transmits the countermeasure information item to said first and second network devices in conformity with the selected countermeasure information item;
said first and second network devices receive the countermeasure information item, and reflect the countermeasure information item or alter their settings on the basis of the countermeasure information item; and
when said server does not receive the failure event again within a predetermined time period since the transmission of the selected countermeasure information item, said server increases the frequence information corresponding to the selected countermeasure information item, by reference to the scenario table. - View Dependent Claims (11, 12, 15, 18)
-
-
10. A server comprising:
-
an interface for communicating with first, second and third network devices which constitute a network;
a scenario table in which object to-be-monitored information that indicates the network device or devices being objects for failure recovery, failure information for identifying contents of failures, countermeasure information against the failures, and frequence information that indicates the numbers of times of recoveries from the failures based on the countermeasure information are correspondingly stored; and
a processing unit;
wherein;
said processing unit receives through said interface, a failure event which is transmitted when the third network device detects that a failure has occurred in a transfer to the network, due to the failure of the first or second network device, and which contains the object to-be-monitored information indicating the third network device, and the failure information for identifying the failure of a transfer function;
said processing unit searches for one or more countermeasure information items which correspond to the object to-be-monitored information and the failure information that are contained in the failure event, by reference to said scenario table, and selects from among the pertinent countermeasure information items, one countermeasure information item as to which the corresponding frequence information is the largest, or, is equal to or larger than a predetermined value;
said processing unit transmits the countermeasure information item to the first and second network devices through said interface, in conformity with the selected countermeasure information item; and
when the failure event is not received again within a predetermined time period since the transmission of the selected countermeasure information item, said processing unit increases the frequence information corresponding to the selected countermeasure information item, by reference to said scenario table.
-
Specification