Hierarchical fault management in computer systems
First Claim
1. A hierarchical event management system for use in a computer system comprising:
- a top hierarchical level event manager; and
one or more lower level event managers, wherein upon detection of an event, lower level event managers determine whether responses to the event will affect resources managed by other event managers and if so, the lower level event manager escalates the event to the next higher level event manager, wherein one or more lower level event managers comprise one or more mid-level event managers and one or more lowest level event managers.
7 Assignments
0 Petitions
Accused Products
Abstract
Computer systems and methods of data processing are disclosed in which hierarchical levels of fault/event management are provided that intelligently monitor hardware and software and proactively take action in accordance with a defined fault policy. A fault policy based on a defined hierarchy ensures that for each particular type of failure the most appropriate action is taken. In one embodiment, a master Software Resiliency Manager (SRM) serves as the top hierarchical level fault/event manager, with one or more slave SRMs serving as the next hierarchical level fault/event manager. The software applications resident on each board can also include sub-processes (e.g., local resiliency managers or LRMs) that serve as the lowest hierarchical level fault/event managers.
104 Citations
12 Claims
-
1. A hierarchical event management system for use in a computer system comprising:
-
a top hierarchical level event manager; and
one or more lower level event managers, wherein upon detection of an event, lower level event managers determine whether responses to the event will affect resources managed by other event managers and if so, the lower level event manager escalates the event to the next higher level event manager, wherein one or more lower level event managers comprise one or more mid-level event managers and one or more lowest level event managers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification