High density compute center resilient booting
First Claim
Patent Images
1. A method, comprising:
- initializing a plurality of processing systems;
communicating status information about the operational health of each of the processing systems capable of operation to a management module responsible for managing the processing systems; and
reinitializing one or more of the processing systems, if the management module determines that the one or more of the processing systems is operating in a degraded state based on the status information communicated to the management module from each of the processing systems capable of operation.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method to implement a resilient compute center. A plurality of processing systems is initialized. Each of the processing systems capable of operation communicates status information about its operational health to a management module responsible for managing the processing systems. The management module reinitializing any of the processing systems, if the management module determines that any of the processing systems is operating in a degraded state based on the status information communicated to the management module.
23 Citations
22 Claims
-
1. A method, comprising:
-
initializing a plurality of processing systems;
communicating status information about the operational health of each of the processing systems capable of operation to a management module responsible for managing the processing systems; and
reinitializing one or more of the processing systems, if the management module determines that the one or more of the processing systems is operating in a degraded state based on the status information communicated to the management module from each of the processing systems capable of operation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A processing blade, comprising
at least one processor to execute instructions; -
system memory coupled to the at least one processor;
a communication link to communicatively couple to a management module for managing a rack of processing blades including the processing blade; and
an error module configured to generate status information about the operational health of the processing blade and to communicate the status information to the management module via the communication link. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A system, comprising:
-
a chassis;
a management module supported by the chassis;
a communication plane coupled to the management module; and
a plurality of processing blades supported by the chassis and coupled to the communication plane, one of the processing blades including;
at least one processor to execute instructions;
system memory coupled to the at least one processor; and
a surrogate management module, the surrogate management module coupled to query the management module to determine an operational health of the management module and to assume management duties of the management module if the management module is disabled, the management module coupled to the communication plane to manage the plurality of processing blades. - View Dependent Claims (19, 20, 21, 22)
-
Specification