Computer system failure management with topology-based failure impact determinations
First Claim
Patent Images
1. A method comprising:
- detecting a failure of a first device in a computer system;
identifying impacted and non-impacted devices of said computer system, said impacted devices being impacted by said failure, said non-impacted devices being less impacted as a group by said failure than said impacted devices as a group; and
assigning workloads to devices giving priority to said non-impacted devices over said impacted devices.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides for indicating devices that are impacted by the failures of another device. Then when allocating workloads to devices, non-impacted devices are given priority over impacted devices as allocation targets for workloads.
28 Citations
16 Claims
-
1. A method comprising:
-
detecting a failure of a first device in a computer system; identifying impacted and non-impacted devices of said computer system, said impacted devices being impacted by said failure, said non-impacted devices being less impacted as a group by said failure than said impacted devices as a group; and assigning workloads to devices giving priority to said non-impacted devices over said impacted devices. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system comprising:
-
resource servers for rum user workloads; management servers for managing said resource servers; network infrastructure devices for managing communications among said management and resource servers; a change database in computer-readable storage media indicating direct dependency relationships of said resource servers on said network infrastructure devices and said management servers; a computer executable failure monitor in computer-readable media for detecting a failure of one of said infrastructure devices or said management devices; and a failure marker for updating said database to indicate which of said resource servers is impacted by said failure. - View Dependent Claims (9, 10, 11)
-
-
12. Computer-readable storage media comprising:
-
a computer-executable failure monitor for detecting a failure of a device in a computer system; a computer-executable database for indicating direct dependency relationships between devices of said computer system so that devices directly impacted by said failure can be identified; a computer-executable topology generator for generating a dependency topology of said system from said database to identify devices indirectly impacted by said failure; and a computer executable failure marker for updating said database to indicate which of said devices are impacted by said failure. - View Dependent Claims (13, 14, 15, 16)
-
Specification