Method and system to recover a failed flash of a blade service processor in a server chassis
First Claim
1. A method comprising:
- attempting to download computer code from a management module to a first data processing unit, the first data processing unit being one of a plurality of data processing units managed by the management module, the plurality of data processing units being coupled to the management module by a management pathway;
in response to the computer code failing to properly download and thus resulting in the first data processing unit being unable to define its management pathway location on the management pathway, transmitting a failure signal from the first data processing unit to the management module using a special address on the management pathway;
comparing management pathway locations of currently active data processing units with a list of all management pathway locations reserved for all of the plurality of data processing units, the currently active data processing units being from the plurality of data processing units; and
deducing the first data processing unit'"'"'s management pathway location as being a management pathway location that is on the list of all management pathway locations but is not a management pathway location of one of the currently active data processing units.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for recovering a server blade, in a multiple server blade computer, that is lost during a flash update operation on a service processor in the server blade. Because of the flash update failure, the lost server blade is unaware of its location on a management pathway, which in an exemplary form may be a midplane or a bus, which connects the server blade and a management module in a server blade chassis. The lost server blade puts a signal on the management pathway indicating that the flash failed. The signal is put on a special channel reserved for such messages. The management module receives the signal, and then determines which of the multiple server blades are lost due to the flash update failure.
-
Citations
19 Claims
-
1. A method comprising:
-
attempting to download computer code from a management module to a first data processing unit, the first data processing unit being one of a plurality of data processing units managed by the management module, the plurality of data processing units being coupled to the management module by a management pathway; in response to the computer code failing to properly download and thus resulting in the first data processing unit being unable to define its management pathway location on the management pathway, transmitting a failure signal from the first data processing unit to the management module using a special address on the management pathway; comparing management pathway locations of currently active data processing units with a list of all management pathway locations reserved for all of the plurality of data processing units, the currently active data processing units being from the plurality of data processing units; and deducing the first data processing unit'"'"'s management pathway location as being a management pathway location that is on the list of all management pathway locations but is not a management pathway location of one of the currently active data processing units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
a management module; a plurality of server blades coupled to midplane sockets in a multi-blade chassis; a management pathway coupling the management module to the plurality of server blades; a list of reserved management pathway locations that are reserved for the plurality of server blades on the management pathway; and a presence detect device coupled to the management pathway, wherein if a download of computer code to a first server blade in the plurality of server blades fails and causes a management pathway location for the first server blade to become undefined for the first server blade, then the presence detect device detects a failure signal from the first server blade on a special address on the management pathway, thus resulting in the management module comparing the list of reserved management pathway locations with locations of server blades that are currently active on the management pathway to restore a management pathway location to the first server blade, wherein a failure to properly download the computer code to the first server blade results in the first server blade being unable to determine an address for a specific midplane socket in the multi-blade chassis. - View Dependent Claims (13, 14, 15)
-
-
16. A computer program product, residing on a non-transitory computer usable medium, comprising:
-
program code for attempting to download computer code from a management module to a first data processing unit, the first data processing unit being One of a plurality of data processing units managed by the management module, the plurality of data processing units being coupled to the management module by a management pathway; program code for in response to the computer code failing to properly download and thus resulting in the first data processing unit being unable to define its management pathway location on the management pathway, transmitting a failure signal from the first data processing unit to the management module using a special address on the management pathway; program code for comparing management pathway locations of currently active data processing units with a list of all management pathway locations reserved for all of the plurality of data processing units, the currently active data processing units being from the plurality of data processing units; and program code for deducing the first data processing unit'"'"'s management pathway location as being a management pathway location that is on the list of all management pathway locations but is not a management pathway location of one of the currently active data processing units. - View Dependent Claims (17, 18, 19)
-
Specification