Managing a fault tolerant system
First Claim
Patent Images
1. A method of monitoring a fault tolerant system, comprising:
- maintaining an abstract model of the fault tolerant system;
monitoring operation of the fault tolerant system;
applying fault events received from the fault tolerant system to the abstract model; and
reporting one or more changes in the abstract model to a component in the fault tolerant system, wherein maintaining an abstract model of the fault tolerant system comprises;
receiving a configuration event indicating the removal of at least one component to the fault tolerant system; and
in response, removing at least one corresponding component from the abstract model; and
wherein removing at least one corresponding component from the abstract model comprises;
dissolving the logical association between the at least one corresponding component and one or more components dependent on the at least one corresponding component; and
recalculating a state of the one or more components dependent on the at least one corresponding component.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for managing a fault tolerant system are disclosed. In one implementation a system for managing a fault tolerant system comprises a configuration manager that receives configuration events from the fault tolerant system, a fault normalizer that receives fault events from the fault tolerant system; and a fault tolerance logic engine that constructs a model of the fault tolerant system based on inputs from the configuration manager and generates reporting events in response to inputs from the fault normalizer.
-
Citations
26 Claims
-
1. A method of monitoring a fault tolerant system, comprising:
-
maintaining an abstract model of the fault tolerant system;
monitoring operation of the fault tolerant system;applying fault events received from the fault tolerant system to the abstract model; and reporting one or more changes in the abstract model to a component in the fault tolerant system, wherein maintaining an abstract model of the fault tolerant system comprises; receiving a configuration event indicating the removal of at least one component to the fault tolerant system; and in response, removing at least one corresponding component from the abstract model; and wherein removing at least one corresponding component from the abstract model comprises; dissolving the logical association between the at least one corresponding component and one or more components dependent on the at least one corresponding component; and recalculating a state of the one or more components dependent on the at least one corresponding component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of monitoring a fault tolerant system, comprising:
-
maintaining an abstract model of the fault tolerant system; monitoring operation of the fault tolerant system;
applying fault events received from the fault tolerant system to the abstract model; andreporting one or more changes in the abstract model to a component in the fault tolerant system, wherein maintaining an abstract model of the fault tolerant system comprises; receiving a configuration event indicating an addition or a removal of at least one component to a logical group in the fault tolerant system; and in response, updating the logical group to reflect the addition or removal of the at least one component; and
recalculating a state of a group of components dependent on the at least one component, and wherein recalculating a state of one or more components dependent on the at least one component comprises propagating a failure state associated with the at least one component to a group of components dependent on the at least one component;maintaining a failure indicator that represents one or more failure parameters in the group of components dependent on the at least one component; and setting a group state parameter to indicate a group failure if the failure indicator exceeds a threshold. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
Specification