Fault recovery system and method for adaptively updating order of command executions acoording to past results
First Claim
1. A fault recovery system comprising:
- fault detecting means for transmitting a status report indicating an operating condition of a data processing apparatus;
command execution means for executing a command in said data processing apparatus; and
decision means, responsive to a status report from the fault detecting means, for selecting one of a plurality of fault restoration commands according to an operating condition indicated in the status report and according to a command selection order, and transmitting the selected command to said command execution means, said decision means being further responsive to a subsequent status report from said fault detecting means, for evaluating the result of execution of said transmitted command according to an operating condition indicated in the subsequent status report, and updating said command selection order according to the evaluated result.
1 Assignment
0 Petitions
Accused Products
Abstract
In a memory a number of entries are defined for mapping reference symptom levels of a server to fault restoration commands and to priority levels. In response to a status report from a fault detector indicating an operating state of the server, one of the commands is selected according to the priority levels corresponding to the reported state. The selected command is executed, and a result of the execution is estimated. In response to a subsequent status report, a comparison is made between the estimated result and an operating state indicated in the report. The priority levels are updated according to the comparison result. In a modification, status variables are mapped to the commands. A command is selected according to the status variables of entries to which a reported state corresponds. A success value is determined based on a result of execution of the command. The status variable of the selected command is updated with the determined success value.
-
Citations
33 Claims
-
1. A fault recovery system comprising:
-
fault detecting means for transmitting a status report indicating an operating condition of a data processing apparatus;
command execution means for executing a command in said data processing apparatus; and
decision means, responsive to a status report from the fault detecting means, for selecting one of a plurality of fault restoration commands according to an operating condition indicated in the status report and according to a command selection order, and transmitting the selected command to said command execution means, said decision means being further responsive to a subsequent status report from said fault detecting means, for evaluating the result of execution of said transmitted command according to an operating condition indicated in the subsequent status report, and updating said command selection order according to the evaluated result.
-
-
2. A fault recovery system comprising:
-
fault detecting means for transmitting a status report indicating an operating condition of a data processing apparatus;
command execution means for executing a command in said data processing apparatus;
a store for defining a plurality of entries and mapping in each of said entries one of a plurality of reference fault conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of priority levels; and
decision means, responsive to a status report from said fault detecting means, for detecting one of said reference fault conditions which corresponds to a fault condition indicated in the status report, selecting one of said fault restoration commands according to the priority levels of the entries which correspond to the detected reference fault condition, transmitting the selected command to said command execution means, and estimating a result of the execution of the transmitted command;
said decision means being further responsive to a subsequent status report from said fault detecting means for making a comparison between said estimated result and a fault condition indicated in the subsequent status report and updating said priority levels according to a result of the comparison - View Dependent Claims (3, 4, 5)
-
-
6. A fault recovery system comprising.
fault detecting means for transmitting a status report indicating a value of an operating condition of a data processing apparatus; -
command execution means for executing a command in said data processing apparatus;
a store for defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of status variables; and
decision means, responsive to a first status report from said fault detecting means, for detecting one of said reference values which corresponds to a value indicated in the status report, making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value, selecting a fault restoration command from the entries in which the detected reference value is contained according to said estimated value, and transmitting the selected command to said command execution means, said decision means being further responsive to a second, subsequent status report from said fault detecting means for subtracting the value of the first status report from a value indicated in said second stats report to produce a success value, and updating said calculated status variable with the success value. - View Dependent Claims (7, 8, 9)
-
-
10. A fault recovery system comprising:
-
fault detecting means for transmitting a status report indicating a value of an operating condition of a data processing apparatus;
command execution means for executing a command in said data processing apparatus;
a store for defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands, one of a plurality of priority levels, and one of a plurality of status variables; and
decision means, responsive to a first status report from said fault detecting means, for detecting one of said reference values which corresponds to a value indicated in the status report, making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value, selecting a fault restoration command from said entries in which the detected reference value is contained according to said estimated value and the priority levels of said entries, and transmitting the selected command to said command execution means, said decision means being further responsive to a second, subsequent status report from said fault detecting means for subtracting the value of the first status report from a value indicated in said second status report to produce a success value, and updating said calculated status variable with the success value. - View Dependent Claims (11, 12, 13)
-
-
14. A fault recovery method for a data processing apparatus to which fault detecting means is connected for transmitting a status report indicating an operating condition of the data processing apparatus and to which a command execution means is further connected for executing a command in said data processing apparatus, the method comprising the steps of:
-
a) responsive to a status report from the fault detecting means, selecting one of a plurality of fault restoration commands according to an operating condition indicated in the status report and according to a command selection order;
b) transmitting the selected command to said command execution means; and
c) responsive to a subsequent status report from said fault detecting means, evaluating the result of execution of said transmitted command according to an operating condition indicated in the subsequent status report, and updating said command selection order according to the evaluated result.
-
-
15. A fault recovery method for a data processing apparatus to which fault detecting means is connected for transmitting a status report indicating an operating condition of the data processing apparatus and to which a command execution means is further connected for executing a command in said data processing apparatus, the method comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference fault conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of priority levels;
b) responsive to a status report from said fault detecting means, detecting one of said reference fault conditions which corresponds to a fault condition indicated in the status report;
c) selecting one of said fault restoration commands according to the priority levels of the entries which correspond to the detected reference fault condition and transmitting the selected command to said command execution means;
d) estimating a result of the execution of the transmitted command; and
e) responsive to a subsequent status report from said fault detecting means for making a comparison between said estimated result and a fault condition indicated in the subsequent status report and updating said priority levels according to a result of the comparison. - View Dependent Claims (16, 17)
-
-
18. A fault recovery method for a data processing apparatus to which fault detecting means is connected for transmitting a status report indicating an operating condition of the data processing apparatus and to which a command execution means is further connected for executing a command in said data processing apparatus, the method comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of status variables;
b) responsive to a first status report from said fault detecting means, detecting one of said reference values which corresponds to a value indicated in the status report;
c) making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value;
d) selecting a fault restoration command from the entries in which the detected reference value is contained according to said estimated value, and transmitting the selected command to said command execution means; and
e) responsive to a second, subsequent status report from said fault detecting means for subtracting the value of the first status report from a value indicated in said second status report to produce a success value and updating said calculated status variable with the success value. - View Dependent Claims (19, 20)
-
-
21. A fault recovery method for a data processing apparatus to which fault detecting means is connected for transmitting a status report indicating an operating condition of the data processing apparatus and to which a command execution means is further connected for executing a command in said data processing apparatus, the method comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands, one of a plurality of priority levels, and one of a plurality of status variables;
b) responsive to a first status report from said fault detecting means, detecting one of said reference values which corresponds to a value indicated in the status report;
c) making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value;
d) selecting a fault restoration command from said entries in which the detected reference value is contained according to said estimated value and the priority levels of said entries and transmitting the selected command to said command execution means; and
e) responsive to a second, subsequent status report from said fault detecting means, subtracting the value of the first status report from a value indicated in said second status report to produce a success value, and updating said calculated status variable with the success value. - View Dependent Claims (22, 23)
-
-
24. A storage medium containing a computer-executable program for recovering a data processing apparatus from failure, wherein fault detecting means is connected to the data processing apparatus for transmitting a status report indicating an operating condition of the data processing apparatus and wherein command execution means is further connected for executing a command in said data processing apparatus, the program comprising the steps of:
-
a) responsive to a status report from the fault detecting means, selecting one of a plurality of fault restoration commands according to an operating condition indicated in the status report and according to a command selection order;
b) transmitting the selected command to said command execution means; and
c) responsive to a subsequent status report from said fault detecting means, evaluating the result of execution of said transmitted command according to an operating condition indicated in the subsequent status report, and updating said command selection order according to the evaluated result.
-
-
25. A storage medium containing a computer-executable program for recovering a data processing apparatus from failure, wherein fault detecting means is connected to the data processing apparatus for transmitting a status report indicating an operating condition of the data processing apparatus and wherein command execution means is further connected for executing a command in said data processing apparatus, the program comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference fault conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of priority levels;
b) responsive to a status report from said fault detecting means, detecting one of said reference fault conditions which corresponds to a fault condition indicated in the status report;
c) selecting one of said fault restoration commands according to the priority levels of the entries which correspond to the detected reference fault condition and transmitting the selected command to said command execution means;
d) estimating a result of the execution of the transmitted command; and
e) responsive to a subsequent status report from said fault detecting means for making a comparison between said estimated result and a fault condition indicated in the subsequent status report and updating said priority levels according to a result of the comparison. - View Dependent Claims (26, 27)
-
-
28. A storage medium containing a computer-executable program for recovering a data processing apparatus from failure, wherein fault detecting means is connected to the data processing apparatus for transmitting a status report indicating an operating condition of the data processing apparatus and wherein command execution means is further connected for executing a command in said data processing apparatus, the program comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands and one of a plurality of status variables;
b) responsive to a first status report from said fault detecting means, detecting one of said reference values which corresponds to a value indicated in the status report;
c) making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value;
d) selecting a fault restoration command from the entries in which the detected reference value is contained according to said estimated value, and transmitting the selected command to said command execution means; and
e) responsive to a second, subsequent status report from said fault detecting means for subtracting the value of the first status report from a value indicated in said second status report to produce a success value and updating said calculated status variable with the success value. - View Dependent Claims (29, 30)
-
-
31. A storage medium containing a computer-executable program for recovering a data processing apparatus from failure, wherein fault detecting means is connected to the data processing apparatus for transmitting a status report indicating an operating condition of the data processing apparatus and wherein command execution means is further connected for executing a command in said data processing apparatus, the program comprising the steps of:
-
a) defining a plurality of entries and mapping in each of said entries one of a plurality of reference values of operating conditions of said data processing apparatus to one of a plurality of fault restoration commands, one of a plurality of priority levels, and one of a plurality of status variables;
b) responsive to a first status report from said fault detecting means, detecting one of said reference values which corresponds to a value indicated in the status report;
c) making an estimation of success or failure of command execution by calculating a sum of a status variable mapped to the detected reference value and the value of the first status report and comparing said sum with said detected reference value to produce an estimated value;
d) selecting a fault restoration command from said entries in which the detected reference value is contained according to said estimated value and the priority levels of said entries and transmitting the selected command to said command execution means; and
e) responsive to a second, subsequent status report from said fault detecting means, subtracting the value of the first status report from a value indicated in said second status report to produce a success value, and updating said calculated status variable with the success value. - View Dependent Claims (32, 33)
-
Specification