Synchronizing device error information among nodes
First Claim
Patent Images
1. A method, comprising:
- performing, by a first node, an action with respect to a first node error counter for a device in communication with the first node and a second node, wherein the first node operates as an owner of the device performing error handling for the device;
transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device;
performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device;
taking over, by the second node, ownership of the device; and
incrementing, by the second node, the second node error counter, indicating errors at the device detected by the first node, in response to the second node detecting an error at the device.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are a method, system, and article of manufacture for synchronizing device error information among nodes. A first node performs an action with respect to a first node error counter for a device in communication with the first node and a second node. The first node transmits a message to the second node indicating the device and the action performed with respect to the first node error counter for the device. The second node performs the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device.
19 Citations
28 Claims
-
1. A method, comprising:
-
performing, by a first node, an action with respect to a first node error counter for a device in communication with the first node and a second node, wherein the first node operates as an owner of the device performing error handling for the device; transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; taking over, by the second node, ownership of the device; and incrementing, by the second node, the second node error counter, indicating errors at the device detected by the first node, in response to the second node detecting an error at the device. - View Dependent Claims (2, 3, 4)
-
-
5. A method, comprising:
-
performing, by a first node, an open action with respect to a first node error counter for a device in communication with the first node and a second node; transmitting, by the first node, a message to the second node indicating the device and the open action the first node performed with respect to the first node error counter for the device; performing, by the second node, the open action indicated in the message by creating a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
6. A method, comprising:
-
performing, by a first node, an expire action with respect to a first node error counter for a device in communication with the first node and a second node; transmitting, by the first node, a message to the second node indicating the device and the expire action performed with respect to the first node error counter for the device; and performing, by the second node, the expire action by expiring a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
7. A method, comprising:
-
performing, by a first node, an action with respect to a first node first device error counter for a first device in communication with the first node and a second node; transmitting, by the first node, a message to the second node indicating the first device and the action performed with respect to the first node first device error counter for the first device; performing, by the second node, the action indicated in the message with respect to a second node first device error counter for the first device indicated in the message, wherein the second node first device error counter corresponds to the first node first device error counter for the first device; maintaining, by the second node, a second node second device error counter indicating a number of errors at a second device in communication with the first node and the second node; and maintaining, by the first node, a first node second device error counter indicating a number of errors at the second device including errors detected by the second node. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A method, comprising:
-
performing, by a first node, an action with respect to a first node error counter for a device in communication with the first node and a second node; transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; initiating, by the second node, a reinitialization operation; sending, by the second node, a join message to the first node as part of the reinitialization operation; sending, by the first node, a message to the second node indicating a value of the first node error counter for the device in response to the join message; and updating, by the second node, the second node error counter to the value indicated in the message sent in response to the join message.
-
-
15. A system in communication with a device, comprising:
-
a first node; and a second node, wherein the first node executes code to perform operations, the operations comprising; performing an action with respect to a first node error counter for the device in communication with the first node and the second node, wherein the first node operates as an owner of the device performing error handling for the device; transmitting a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; and wherein the second node executes code to perform operations, the operations comprising; performing the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; taking over ownership of the device; and incrementing the second node error counter, indicating errors at the device detected by the first node, in response to the second node detecting an error at the device. - View Dependent Claims (16)
-
-
17. A system in communication with a first device and a second device, comprising:
-
a first node; and a second node, wherein the first node executes code to perform operations, the operations comprising; performing an action with respect to a first node first device error counter for the first device in communication with the first node and the second node; transmitting a message to the second node indicating the first device and the action performed with respect to the first node first device error counter for the first device; and maintains maintaining a first node second device error counter indicating a number of errors at the second device including errors detected by the second node; and wherein the second node executes code to perform operations, the operations comprising; performing the action indicated in the message with respect to a second node first device error counter for the first device indicated in the message, wherein the second node first device error counter corresponds to the first node first device error counter for the first device; and maintaining a second node second device error counter indicating a number of errors at the second device in communication with the first node and the second node. - View Dependent Claims (18)
-
-
19. An article of manufacture comprising a computer readable storage medium including code executed by a first node and a second node, wherein a device is in communication with the first node and the second node, wherein the code is executed to cause operations, the operations comprising:
-
performing, by the first node, an action with respect to a first node error counter for the device, wherein the first node operates as an owner of the device performing error handling for the device; transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; taking over, by the second node, ownership of the device; and incrementing, by the second node, the second node error counter, indicating errors at the device detected by the first node, in response to the second node detecting an error at the device. - View Dependent Claims (20)
-
-
21. An article of manufacture comprising a computer readable storage medium including code executed by a first node and a second node, wherein a first device and a second device are in communication with the first node and the second node, wherein the code is executed to cause operations, the operations comprising:
-
performing, by the first node, an action with respect to a first node first device error counter for the first device; transmitting, by the first node, a message to the second node indicating the first device and the action performed with respect to the first node first device error counter for the first device; performing, by the second node, the action indicated in the message with respect to a second node first error counter for the first device indicated in the message, wherein the second node first device error counter corresponds to the first node first device error counter for the first device; maintaining, by the second node, a second node second device error counter indicating a number of errors at the second device in communication with the first node and the second node; and maintaining, by the first node, a first node second device error counter indicating a number of errors at the second device including errors detected by the second node. - View Dependent Claims (22)
-
-
23. A system in communication with a device, comprising:
-
a first node; and a second node, wherein the first node executes code to perform operations, the operations comprising; performing an open action with respect to a first node error counter for the device in communication with the first node and the second node; transmitting a message to the second node indicating the device and the open action performed with respect to the first node error counter for the device; and wherein the second node executes code to perform the open action indicated in the message by creating a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
24. A system in communication with a device, comprising:
-
a first node; and a second node, wherein the first node executes code to perform operations, the operations comprising; performing an expire action with respect to a first node error counter for the device in communication with the first node and the second node; transmitting a message to the second node indicating the device and the expire action performed with respect to the first node error counter for the device; and wherein the second node executes code to perform the expire action by expiring a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
25. A system in communication with a device, comprising:
-
a first node; and a second node, wherein the first node executes code and the second node executes code to perform operations, the operations comprising; performing, by the first node, an action with respect to a first node error counter for the device in communication with the first node and the second node; transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; initiating, by the second node, a reinitialization operation; sending, by the second node, a join message to the first node as part of the reinitialization operation; sending, by the first node, a message to the second node indicating a value of the first node error counter for the device in response to the join message; and updating, by the second node, the second node error counter to the value indicated in the message sent in response to the join message.
-
-
26. An article of manufacture comprising a computer readable storage medium including code executed by a first node and a second node, wherein a device is in communication with the first node and the second node, wherein the code is executed to cause operations, the operations comprising:
-
performing, by the first node, an open action with respect to a first node error counter for the device; transmitting, by the first node, a message to the second node indicating the device and the open action performed with respect to the first node error counter for the device; performing, by the second node, the open action indicated in the message by creating a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
27. An article of manufacture comprising a computer readable storage medium including code executed by a first node and a second node, wherein a device is in communication with the first node and the second node, wherein the code is executed to cause operations, the operations comprising:
-
performing an expire action with respect to a first node error counter for the device in communication with the first node and the second node; transmitting a message to the second node indicating the device and the expire action performed with respect to the first node error counter for the device; and performing, by the second node, the expire action by expiring a second node error counter for the device in response to the message, wherein the second node error counter corresponds to the first node error counter for the device.
-
-
28. An article of manufacture comprising a computer readable storage medium including code executed by a first node and a second node, wherein a device is in communication with the first node and the second node, wherein the code is executed to cause operations, the operations comprising:
-
performing, by the first node, an action with respect to a first node error counter for the device in communication with the first node and the second node; transmitting, by the first node, a message to the second node indicating the device and the action performed with respect to the first node error counter for the device; performing, by the second node, the action indicated in the message with respect to a second node error counter for the device indicated in the message, wherein the second node error counter corresponds to the first node error counter for the device; initiating, by the second node, a reinitialization operation; sending, by the second node, a join message to the first node as part of the reinitialization operation; sending, by the first node, a message to the second node indicating a value of the first node error counter for the device in response to the join message; and updating, by the second node, the second node error counter to the value indicated in the message sent in response to the join message.
-
Specification