System and method for fault detection and recovery
First Claim
Patent Images
1. A method for automatically detecting and recovering from a fault in a microprocessor-based system, comprising:
- capturing the fault, by the microprocessor-based system, as an event;
processing the event, by the microprocessor-based system, including thresholding the event and co-relating the event to a cause;
determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has been running;
determining a recovery action, by the microprocessor-based system, as a function of the thresholding, the co-relating, and the elapsed time the microprocessor-based system has been running, and the recovery action is used to perform at least one of a restart and a cleanup of memory andperforming, by the microprocessor-based system, the recovery action.
0 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method for automatically detecting and recovering from a fault in a microprocessor-based system. The apparatus and method utilizes a leaky bucket routine and an event handler procedure. The method may further use Object Oriented techniques that abstracts differences between hardware and software faults to allow for the development of a common framework.
18 Citations
22 Claims
-
1. A method for automatically detecting and recovering from a fault in a microprocessor-based system, comprising:
-
capturing the fault, by the microprocessor-based system, as an event; processing the event, by the microprocessor-based system, including thresholding the event and co-relating the event to a cause; determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has been running; determining a recovery action, by the microprocessor-based system, as a function of the thresholding, the co-relating, and the elapsed time the microprocessor-based system has been running, and the recovery action is used to perform at least one of a restart and a cleanup of memory and performing, by the microprocessor-based system, the recovery action. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for automatically detecting and recovering from a fault in a microprocessor-based system defined as a hierarchical set of objects to model software, hardware, and external entities the system communicates with, the method comprising:
-
capturing, by the microprocessor-based system, the fault as an event; processing, by the microprocessor-based system, the event including thresholding the event and co-relating the event to a cause, the thresholding using a leaky bucket algorithm and error counting; determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has been running; determining, by the microprocessor-based system, a recovery action as a function of the thresholding, the co-relating, and the elapsed time the microprocessor-based system has been running, relationships imposed by the hierarchical set of objects, event histories, and system state information, wherein root cause correlation of events occurs across all domains and hierarchies of the system to determine recovery actions that provide for specific recoveries and system escalations, and the recovery action is used to perform at least one of a restart and a cleanup of memory; and performing, by the microprocessor-based system, the recovery action. - View Dependent Claims (12)
-
-
13. A method of automatically detecting and recovering from a fault in a microprocessor-based system, comprising:
-
capturing, by the microprocessor-based system, the fault as an event processing, by the microprocessor-based system, the event including thresholding the event and co-relating the event to a cause selected from a list of potential causes, the list of potential causes including potential hardware causes and potential software causes; determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has been running; determining, by the microprocessor-based system, a recovery action as a function of the thresholding and the co-relating and the elapsed time the microprocessor-based system has been running, and the recovery action is used to perform at least one of a restart and a cleanup of memory; and performing, by the microprocessor-based system, the recovery action. - View Dependent Claims (14, 15, 16)
-
-
17. A method of automatically detecting and recovering from a fault in a microprocessor-based system, comprising:
-
capturing, by the microprocessor-based system, the fault as an event; processing, by the microprocessor-based system, the event including thresholding the event and co-relating the event to a cause selected from a list of potential causes, the list of potential causes including potential causes internal to the microprocessor-based system and potential causes external to the microprocessor-based system; determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has system has been running; determining, by the microprocessor-based system, a recovery action as a function of the thresholding and the co-relating and the elapsed time the microprocessor-based system has been running, and the recovery action is used to perform at least one of a restart and a cleanup of memory; and performing, by the microprocessor-based system, the recovery action. - View Dependent Claims (18, 19, 20)
-
-
21. A method for recovering from a fault in a microprocessor-based system, the method comprising:
-
capturing, by the microprocessor-based system, the fault as an event; determining, by the microprocessor-based system, an elapsed time the microprocessor-based system has been running; determining, by the microprocessor-based system, a recovery action as a function of the elapsed time the microprocessor-based system has been running, and the recovery action is used to perform at least one of a restart and a cleanup of memory; and performing, by the microprocessor-based system, the recovery action. - View Dependent Claims (22)
-
Specification