Method of handling errors in software
First Claim
1. In a data processing system containing a plurality of individually identifiable data processing modules which allow the data processing system to execute data processing operations, a method of recovery from faults occurring in the modules comprising the steps of:
- detecting a presence of a fault caused by one of the data processing modules during the execution of one of the data processing operations; and
performing a fault processing routine in the data processing system including the substeps ofidentifying as a faulting module the one of the data processing modules that caused the detected fault,identifying a nature of the fault,determining, from the nature of the fault, whether the data processing system is capable of continuing operation reliably despite the presence of the fault,disabling the faulting module from further operation with the data processing system if the data processing system is not capable of continuing operation reliably because of the fault, andpreventing the faulting module from causing additional faults without disabling the faulting module if the data processing system is capable of continuing operation reliably despite the fault.
2 Assignments
0 Petitions
Accused Products
Abstract
The software error handling determines the nature of the fault and takes different action depending upon the nature of the fault. If the fault prevents the data processing system from continued reliable operation, then the element causing the fault is immediately disabled. Otherwise, the element which is the source of the fault is treated so that it does no harm to the system and causes no further faults. The element can then be completely handled during normal software status checks.
130 Citations
31 Claims
-
1. In a data processing system containing a plurality of individually identifiable data processing modules which allow the data processing system to execute data processing operations, a method of recovery from faults occurring in the modules comprising the steps of:
-
detecting a presence of a fault caused by one of the data processing modules during the execution of one of the data processing operations; and performing a fault processing routine in the data processing system including the substeps of identifying as a faulting module the one of the data processing modules that caused the detected fault, identifying a nature of the fault, determining, from the nature of the fault, whether the data processing system is capable of continuing operation reliably despite the presence of the fault, disabling the faulting module from further operation with the data processing system if the data processing system is not capable of continuing operation reliably because of the fault, and preventing the faulting module from causing additional faults without disabling the faulting module if the data processing system is capable of continuing operation reliably despite the fault. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method of recovering from faults occurring in a data processing system comprising the steps of:
-
detecting a presence of a fault in a first data processor of a pair of data processors in the data processing system, each of said data processors including a central processing unit, (CPU), means for controlling a memory, and a system bus interconnecting said CPU and said memory control means; performing a fault processing routine in the data processing system including the substeps of; identifying as a faulting data processor one of the pair of data processors in which the fault occurred; identifying a nature of the fault; determining from the nature of the fault whether the faulting data processor is capable of continuing operation reliable despite the presence of the fault; disabling the pair of data processors from further operation with the data processing system if the faulting data processor is not capable of continuing operation reliably because of the fault; and preventing the faulting data processor from causing additional faults without disabling the faulting data processor if the data processing system is capable of continuing operation reliably despite the fault. - View Dependent Claims (24, 25, 26)
-
-
27. A method of recovering from faults occurring in a data processing system comprising the steps of:
-
detecting a presence of a fault in a first zone of a dual zone data processing system with each zone including a pair of data processors, and each of said data processors including a CPU, means for controlling a memory, and a system bus interconnecting said CPU and said memory control means; performing a fault processing routine in the data processing system including the substeps of; identifying as a faulting zone the zone of the dual zone data processing system in which the fault occurred; identifying a nature of the fault; determining from the nature of the fault whether the data processing system is capable of continuing operation reliably despite the presence of a fault; disabling the faulting zone from further operation with the data processing system if the faulting zone is not capable of reliably continuing operation with the data processing system because of the fault in the faulting zone; and preventing the faulting zone from causing additional faults without disabling the faulting zone if the data processing system is capable of reliably continuing operation with the faulting zone despite the fault in the faulting zone. - View Dependent Claims (28, 29, 30, 31)
-
Specification