Method and apparatus for monitoring and analyzing network appliance status information
First Claim
Patent Images
1. Apparatus for monitoring the status of multiple devices comprising:
- a first status message generator located in a first device;
a second status message generator located in a second device;
a first status monitor located in said second device;
a second status monitor located in said first device;
a communication network coupling said first device to said second device using a plurality of communications channels;
where said first status message generator periodically transmits status messages to said first status monitor using said plurality of communications channels and said second status message generator transmits status messages to said second status monitor using said plurality of communications channels; and
wherein said first and second status message generators comprise means for producing a status message, means for periodically transmitting the status message on said plurality of communications channels, and a disconnect counter for identifying a socket error and means for opening a new socket in response to an error.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for identifying a faulty device through generation of status messages within each device and monitoring the status messages of each other device. If status messages are not properly received, the apparatus invokes a fault analysis routine to identify the problem. The status messages are transmitted between the devices using a plurality of communications channels.
155 Citations
15 Claims
-
1. Apparatus for monitoring the status of multiple devices comprising:
-
a first status message generator located in a first device;
a second status message generator located in a second device;
a first status monitor located in said second device;
a second status monitor located in said first device;
a communication network coupling said first device to said second device using a plurality of communications channels;
where said first status message generator periodically transmits status messages to said first status monitor using said plurality of communications channels and said second status message generator transmits status messages to said second status monitor using said plurality of communications channels; and
wherein said first and second status message generators comprise means for producing a status message, means for periodically transmitting the status message on said plurality of communications channels, and a disconnect counter for identifying a socket error and means for opening a new socket in response to an error. - View Dependent Claims (2, 3, 4, 5)
a fault analyzer to determine which, if any, device has failed.
-
-
4. The apparatus of claim 3 wherein said fault analyzer comprises:
means for performing a distributed fault analysis by collecting status information from each of the devices.
-
5. The apparatus of claim 1 wherein said devices are storage controller modules coupled to a storage pool.
-
6. Apparatus for monitoring the status of multiple devices comprising:
-
a first status message generator located in a first device;
a second status message generator located in a second device;
a first status monitor located in said second device;
a second status monitor located in said first device;
a communications network coupling said first device to said second device using a plurality of communications channels;
where said first status message generator periodically transmits status messages to said first status monitor using said plurality of communications channels and said second status message generator transmits status messages to said second status monitor using said plurality of communications channels;
wherein said first and second status message monitors comprise means for opening a plurality of communications sockets, means for counting a number of received status messages, and means for invoking a fault analysis if the number of received status messages is less than a predefined number on at least one of the communications channels in said plurality of communications channels; and
wherein said means for counting comprises a counter that is initialized to a predefined number, means for incrementing the counter when a status message is received, and means for decrementing a counter on a periodic basis.
-
-
7. Apparatus for monitoring the status of multiple devices comprising:
-
a first status message generator located in a first device;
a second status message generator located in a second device;
a first status monitor located in said second device;
a second status monitor located in said first device;
a communications network coupling said first device to said second device using a plurality of communications channels;
where said first status message generator periodically transmits status messages to said first status monitor using said plurality of communications channels and said second status message generator transmits status messages to said second status monitor using said plurality of communications channels;
means for determining a quality of service metric for said first and second devices; and
means for comparing at least one quality of service metric of said first device to the quality of service metric of said second device to determine if either said first or second devices should failover. - View Dependent Claims (8)
-
-
9. A network of storage system comprising:
-
a first storage control module comprising a first status message generator and a second status message monitor;
a second storage control module comprising a second status message generator and a first status message monitor;
at least one storage array coupled to both said first and second storage control modules;
a communications network coupling said first storage control module to said second storage control module using a plurality of communications channels;
where said first status message generator periodically transmits status messages to said second status monitor using a plurality of communications channels and said second status message generator transmits status messages to said first status monitor using a plurality of communications channels; and
wherein said first and second status message generators comprise means for producing a status message, means for periodically transmitting the status message on said plurality of communications channels, and a disconnect counter for identifying a socket error and means for opening a new socket in response to an error.
-
-
10. A method of monitoring the status of multiple devices comprising:
-
periodically generating and transmitting status messages from a first status monitor to a second status monitor using a plurality of communications channels;
periodically generating and transmitting status messages from a second status monitor to a first status monitor using said plurality of communications channels, where said first or second status monitors invoke a failure analysis if a predefined number of status messages are not received on a communications channel;
identifying a socket error and opening a new socket in response to an error, wherein said identifying step further comprises setting a counter to a predefined value and decrementing the counter after each message is transmitted. - View Dependent Claims (11, 12, 13, 14)
invoking a fault analyzer to determine which, if any, device has failed.
-
-
13. The method of claim 12 wherein said fault analyzer comprises:
performing a distributed fault analysis by collecting status information from each of the devices.
-
14. The method of claim 10 wherein said devices are storage controller modules coupled to a storage pool.
-
15. A method of monitoring the status of multiple devices comprising:
-
periodically generating and transmitting status messages from a first status monitor to a second status monitor using a plurality of communications channels;
periodically generating and transmitting status messages from a second status monitor to a first status monitor using a plurality of communications channels, where said first or second status monitors invoke a failure analysis if a predefined number of status messages are not received on a communications channel;
opening a plurality of communications sockets;
counting a number of received status messages, wherein said counting step comprises initializing a counter to a predefined number, incrementing the counter when a status message is received, and decrementing a counter on a periodic basis; and
invoking a fault analysis if the number of received status messages is less than a predefined number on at least one of the communications channels in said plurality of communications channels.
-
Specification