Hardware-error tolerant computing
First Claim
Patent Images
1. A computing system comprising:
- a processor subsystem having an adjustable physical operating parameter;
an information store operable to save a sequence of instructions; and
a controller module including;
a monitor circuit for detecting an incidence of an operating-parameter-caused error corresponding to an execution of an instruction of the sequence of instructions by the processor subsystem;
a recovery circuit operable to rollback an execution of the sequence of instructions to a checkpoint in response to the detected operating-parameter-caused error; and
a control circuit for adjusting the adjustable physical operating parameter based upon an error-tolerant performance criterion corresponding to an incidence of the detected operating-parameter-caused error, to substantially minimize a time to complete an execution of the sequence of instructions by adjusting the adjustable physical operating parameter including adjusting a processor subsystem voltage in response to the error-tolerant performance criterion, wherein the error-tolerant performance criterion corresponds in substantial conformity to 0=(Δ
time to complete an execution of the sequence of instructions) divided by (Δ
adjustable physical operating parameter).
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments include a computing system, a device, and a method. A computing system includes a processor subsystem having an adjustable operating parameter. The computing system also includes an information store operable to save a sequence of instructions. The computing system further includes a controller module. The controller module includes a monitor circuit for detecting an incidence of an operating-parameter-caused error corresponding to an execution of an instruction of the sequence of instructions by the processor subsystem. The controller further includes a control circuit for adjusting the adjustable operating parameter based upon an error-tolerant performance criterion.
127 Citations
30 Claims
-
1. A computing system comprising:
-
a processor subsystem having an adjustable physical operating parameter; an information store operable to save a sequence of instructions; and a controller module including; a monitor circuit for detecting an incidence of an operating-parameter-caused error corresponding to an execution of an instruction of the sequence of instructions by the processor subsystem; a recovery circuit operable to rollback an execution of the sequence of instructions to a checkpoint in response to the detected operating-parameter-caused error; and a control circuit for adjusting the adjustable physical operating parameter based upon an error-tolerant performance criterion corresponding to an incidence of the detected operating-parameter-caused error, to substantially minimize a time to complete an execution of the sequence of instructions by adjusting the adjustable physical operating parameter including adjusting a processor subsystem voltage in response to the error-tolerant performance criterion, wherein the error-tolerant performance criterion corresponds in substantial conformity to 0=(Δ
time to complete an execution of the sequence of instructions) divided by (Δ
adjustable physical operating parameter). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method implemented in a computerized system, the method comprising:
-
detecting an incidence of a processor-operating-parameter-caused error corresponding to an execution of an instruction of a sequence of instructions by a processor subsystem having an adjustable physical operating parameter; rollbacking the execution of the instruction of the sequence to a checkpoint in response to the detected incidence of the operating-parameter-caused error; and changing the adjustable processor physical operating parameter based upon an error-tolerant performance criterion corresponding to an incidence of the detected operating-parameter-caused error, to substantially minimize a time required to successfully complete an execution of the sequence of instructions by adjusting the adjustable physical operating parameter including adjusting a processor subsystem voltage in response to the error-tolerant performance criterion, wherein the error-tolerant performance criterion corresponds in substantial conformity to 0=(Δ
time to complete an execution of the sequence of instructions) divided by (Δ
adjustable physical operating parameter). - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
-
-
29. A device comprising:
-
means for detecting an incidence of a processor-operating-parameter-caused error corresponding to an execution of an instruction of the sequence of instructions by a processor subsystem having an adjustable physical operating parameter; means for changing the adjustable processor physical operating parameter based upon an error-tolerant performance criterion corresponding to an incidence of the detected operating-parameter-caused error, to substantially minimize a time required to successfully complete an execution of the sequence of instructions by adjusting the adjustable physical operating parameter including adjusting a processor subsystem voltage in response to the error-tolerant performance criterion, wherein the error-tolerant performance criterion corresponds in substantial conformity to 0=(Δ
time to complete an execution of the sequence of instructions) divided by (Δ
adjustable physical operating parameter); andmeans for restoring an execution of the sequence of program instructions to a checkpoint based upon the detected processor-operating-parameter-caused error in response to the detected incidence of the operating-parameter-caused error.
-
-
30. A method implemented in a computerized system that includes a processor system having an adjustable physical operating parameter, the method comprising:
-
detecting an incidence of a processor error corresponding to an execution by the processor system of an instruction of a sequence of instructions; rollbacking the execution of the instruction of the sequence to a checkpoint in response to the detected incidence of the operating-parameter-caused error; and changing the adjustable processor physical operating parameter in response to the detected incidence of error and in response to an error-tolerant performance criterion corresponding to an incidence of the detected operating-parameter-caused error, to substantially minimize a time required to successfully complete an execution of the sequence of instructions by adjusting the adjustable physical operating parameter, wherein the error-tolerant performance criterion corresponds in substantial conformity to 0=(Δ
time to complete an execution of the sequence of instructions) divided by (Δ
adjustable physical operating parameter).
-
Specification