Serviceability and test infrastructure for distributed systems
First Claim
Patent Images
1. A method for servicing a computer system comprising:
- delivering a dedicated message to all nodes in said system affected by an event, wherein said nodes are members of a computer cluster with one or more nodes to coordinate access to a storage area network, and a cluster leader in said cluster to own one or more tasks for which members of said cluster require communication with said cluster leader to support a service; and
capturing a state of at least two of said nodes responsive to receipt of said message, wherein the step of capturing a state of at least two of said nodes supports creation of a logical image of each of said nodes at a point in time, and prevents transition of said nodes to another state.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for capturing a state of a distributed computer system is provided. The state is captured in response to an error or event message received by one of the clients and/or server nodes of the system. In response to receipt of the error or event message, the recipient initiates transmission of a special protocol message of affected members of the system. Upon receipt of the message, all recipients will conduct a freeze of their respective operating system image. Depending upon the characteristics of the error or event, the message may be transmitted to a selection of members of the system, or the entire system.
-
Citations
19 Claims
-
1. A method for servicing a computer system comprising:
-
delivering a dedicated message to all nodes in said system affected by an event, wherein said nodes are members of a computer cluster with one or more nodes to coordinate access to a storage area network, and a cluster leader in said cluster to own one or more tasks for which members of said cluster require communication with said cluster leader to support a service; and capturing a state of at least two of said nodes responsive to receipt of said message, wherein the step of capturing a state of at least two of said nodes supports creation of a logical image of each of said nodes at a point in time, and prevents transition of said nodes to another state. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system comprising:
-
a processor; an application program executed by the processor, wherein the application program comprising; a coordinator to deliver a dedicated message to all nodes in said system affected by occurrence of an event, wherein said nodes are members of a computer cluster with one or more nodes to coordinate access to a storage area network and a cluster leader in said cluster for at least one function in said system; and a capture of a state on at least two of said nodes upon receipt of said message, wherein said capture of a state creates a logical image of said at least two nodes at a point in time and prevents transmission of a message from said at least two nodes. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An article comprising:
-
a computer-readable storage medium; means in the medium for delivering a dedicated message to all nodes affected by an event, wherein said nodes are members of a computer cluster with one or more nodes to coordinate access to a storage area network and a cluster leader in said cluster for at least one function in said system; and means in the medium for initiating a capture of a state of at least two of said nodes upon receipt of said message, wherein said capture of a state is a logical image of said node, and wherein said capture of a state prevents transmission of a message from said node. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A method for servicing a computer system comprising:
-
delivering an out-of-band disk based message to all nodes in said system, and in communication with a storage area network, affected by an event, wherein said nodes are members of a computer cluster with one or more nodes to coordinate access to a storage area network; using said message to freeze a state of at least two of said nodes upon receipt of said message; and preventing transmission of a message from said node having a frozen state.
-
Specification