Method and apparatus for indicating the severity of a fault within a computer system
First Claim
1. In an electrical apparatus having a plurality of hardware elements and a number of error detection circuits, the number of error detection circuits being coupled to predetermined ones of the plurality of hardware elements, each of the number of error detection circuits continuously monitoring a corresponding one of the predetermined ones of the plurality of hardware elements and providing a corresponding error bit whenever an error is detected thereby, each of the corresponding error bits being serviced by a support controller, the improvement of the electrical apparatus comprising:
- a. receiving means coupled to at least one of the number of error detection circuits for receiving a corresponding error bit from each of said at least one of the number of error detection circuits; and
b. determining means coupled to said receiving means for determining which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced immediately by the support controller, and which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced at a predetermined time later by the support controller.
9 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for efficiently identifying and indicating the severity of the fault within a computer system. In an exemplary embodiment of the present invention, the circuitry of a computer system may be divided into a number of groups. Each group may contain circuitry which may result in the same fault type. For example, predetermined circuitry which, when a fault is detected therein, may have a minimal affect on the normal operation of the computer system may be provided in a first group. Similarly, predetermined circuitry which, when a fault is detected therein, may have an immediate affect on the normal operation of the computer system may be provided in a second group. Each group may provide an error priority signal to a support controller. The support controller may interpret the number of error priority signals provided by the number of groups and may determine the appropriate time to take corrective action thereon.
-
Citations
48 Claims
-
1. In an electrical apparatus having a plurality of hardware elements and a number of error detection circuits, the number of error detection circuits being coupled to predetermined ones of the plurality of hardware elements, each of the number of error detection circuits continuously monitoring a corresponding one of the predetermined ones of the plurality of hardware elements and providing a corresponding error bit whenever an error is detected thereby, each of the corresponding error bits being serviced by a support controller, the improvement of the electrical apparatus comprising:
-
a. receiving means coupled to at least one of the number of error detection circuits for receiving a corresponding error bit from each of said at least one of the number of error detection circuits; and b. determining means coupled to said receiving means for determining which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced immediately by the support controller, and which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced at a predetermined time later by the support controller. - View Dependent Claims (2, 3)
-
-
4. A data processing system for executing a number of sequential instructions including a current instruction, comprising:
-
a. a first group of hardware elements and a second group of hardware elements wherein said first group of hardware elements further includes a first error detecting means and said second group of hardware elements further includes a second error detection means, said first error detection means continuously monitoring preselected ones of said first group of hardware elements and detecting a number of predefined errors in the preselected ones of said first group of hardware elements, and said second error detecting means continuously monitoring predefined ones of said second group of hardware elements and detecting a number of predefined errors in the preselected ones of said second group of hardware elements; and b. servicing means coupled to said first group of hardware element and to said second group of hardware elements for servicing said number of predefined errors detected by said first error detecting means and said number of predefined errors detected by said second error detecting means, said servicing means servicing said predefined errors detected in said first group of hardware elements before servicing said predefined errors detected in said second group of hardware elements. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A data processing system for executing a number of sequential instructions including a current instruction, comprising:
-
a. a support controller; b. a first group of hardware element; c. a number of first error detecting means coupled to preselected ones of said first group of hardware elements, said number of first error detecting means for continuously monitoring the preselected ones of said first group of hardware elements and detecting a number of corresponding first errors in said preselected ones of said first group of hardware elements; d. first error storage means coupled to said number of first error detecting means for storing said number of corresponding first errors; e. first disabling means coupled to said first error storage means for disabling said first error storage means once a first one of said number of corresponding first errors is detected; f. first providing means coupled to said first error storage means and further coupled to said support controller for providing a first fault type to said support controller when said first one of said number of corresponding first errors is detected; g. a second group of hardware elements; h. a number of second error detecting means coupled to preselected ones of said second group of hardware elements, said number of second error detecting means for continuously monitoring the preselected ones of said second group of hardware elements and detecting a number of corresponding second errors in said preselected ones of said second group of hardware elements; i. second error storage means coupled to said number of second error detecting means for storing said number of corresponding second errors; j. second disabling means coupled to said second error storage means for disabling said second error storage means once a first one of said number of corresponding second errors is detected; k. second providing means coupled to said second error storage means and further coupled to said support controller for providing a second fault type to said support controller when the first one of said number of corresponding second errors is detected; and l. servicing means coupled to said support controller wherein said servicing means services said first fault type before servicing said second fault type. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. In an electrical apparatus having a plurality of hardware elements and a number of error detection circuits, the number of error detection circuits being coupled to predetermined ones of the plurality of hardware elements, each of the number of error detection circuits continuously monitoring a corresponding one of the predetermined ones of the plurality of hardware elements and providing a corresponding error bit whenever an error is detected thereby, each of the corresponding error bits being serviced by a support controller, the improvement of the electrical apparatus comprising:
-
a. a receiving circuit coupled to at least one of the number of error detection circuits for receiving a corresponding error bit from each of said at least one of the number of error detection circuits; and b. determining circuit coupled to said receiving circuit for determining which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced immediately by the support controller, and which of the corresponding error bits provided by said at least one of the number of error detection circuits are to be serviced at a predetermined time later by the support controller. - View Dependent Claims (25, 26)
-
-
27. A data processing system for executing a number of sequential instructions including a current instruction, comprising:
-
a. a first group of hardware elements and a second group of hardware elements wherein said first group of hardware elements further includes a first error detecting circuit and said second group of hardware elements further includes a second error detection circuit, said first error detection circuit continuously monitoring preselected ones of said first group of hardware elements and detecting a number of predefined errors in the preselected ones of said first group of hardware elements each of said, and said second error detecting circuit continuously monitoring preselected ones of said second group of hardware elements and detecting a number of predefined errors in the preselected ones of said second group of hardware elements; and b. a servicing circuit coupled to said first group of hardware element and to said second group of hardware elements for servicing said number of predefined errors detected by said first error detecting circuit and said number of predefined errors detected by said second error detecting circuit, said servicing circuit servicing said predefined errors detected in said first group of hardware elements before servicing said predefined errors detected in said second group of hardware elements. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A data processing system for executing a number of sequential instructions including a current instruction, comprising:
-
a. a support controller; b. a first group of hardware elements; c. a number of first error detecting circuits coupled to preselected ones of said first group of hardware elements, said number of first error detecting circuits for continuously monitoring a corresponding one of the preselected ones of said first group of hardware elements and detecting a number of corresponding first errors in said preselected ones of said first group of hardware elements; d. a first error storage circuit coupled to said number of first error detecting circuits for storing said number of corresponding first errors; e. a first disabling circuit coupled to said first error storage circuit for disabling said first error storage circuit once a first one of said number of corresponding first errors is detected; f. a first providing circuit coupled to said first error storage circuit and further coupled to said support controller for providing a first fault type to said support controller when said first one of said number of corresponding first errors is detected; g. a second group of hardware elements; h. a number of second error detecting circuits coupled to preselected ones of said second group of hardware elements, said number of second error detecting circuits for continuously monitoring a corresponding one of the preselected ones of said second group of hardware elements and detecting a number of corresponding second errors in said preselected ones of said second group of hardware elements; i. a second error storage circuit coupled to said number of second error detecting circuit for storing said number of corresponding second errors; j. a second disabling circuit coupled to said second error storage circuit for disabling said second error storage circuit once a first one of said number of corresponding second errors is detected; k. a second providing circuit coupled to said second error storage circuit and further coupled to said support controller for providing a second fault type to said support controller when the first one of said number of corresponding second errors is detected; and l. a servicing circuit coupled to said support controller wherein said servicing circuit services said first fault type before servicing said second fault type. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A method for servicing errors within a computer system, the computer system having a number of hardware elements therein, each of the number of hardware elements capable of experiencing an error, and each of the errors having a predefined effect on the operation of the computer system, the number of hardware elements being partitioned into at least two groups wherein an error detected in any of the number of hardware elements of a particular one of the at least two groups having the same predefined effect on the operation of the computer system, the method comprising the steps of:
-
a. providing an error detecting means for each of said at least two groups for continuously monitoring the number of hardware elements in said corresponding one of said at least two groups and detecting errors in said corresponding one of said at least two groups; b. providing a predefined condition for each of said at least two groups; c. executing a program on the computer system; d. detecting an error in at least one of said at least two groups via said error detecting means of step (a); e. determining what group produced the detected error; and f. servicing the detected error when said predefined condition of said corresponding group is satisfied.
-
-
48. A method for servicing errors within a computer system, the computer system having a number of hardware elements therein, each of the number of hardware elements capable of experiencing an error, and each of the errors having a predefined effect on the operation of the computer system, the number of hardware elements being partitioned into a first group of hardware elements and a second group of hardware elements wherein an error in any of the number of hardware elements of the first group of hardware elements causes a critical error in the operation of the computer system and an error in any of the number of hardware elements of the second group of hardware elements causes a passive error in the operation of the computer system, the method comprising the steps of:
-
a. executing a program on the computer system; b. detecting an error; c. determining if the detected error was produced by the first group of hardware elements or by the second group of hardware elements; d. aborting the operation of the computer system and servicing the detected error if said determining step (c) determined that the detected error was produced by the first group of hardware elements; and e. servicing the detected error a predetermined time later if said determining step (c) determined that the detected error was produced by said second group of hardware elements.
-
Specification