Fault resilient/fault tolerant computing
First Claim
Patent Images
1. A method of handling faults in a computer system, the computer system including computing elements, controllers that provide data from data sources to the computing elements, error reporting elements and error processing elements, the method comprising:
- intercepting a request for data by a computing element;
transmitting the intercepted request to the controllers;
having at least one of the controllers respond by transmitting the requested data to the computing element;
detecting, through an error reporting element that comprises a computing element or a controller, an error condition and transmitting information about the error condition as an error message to error processing elements connected to the error reporting element, the error processing elements including at least two of the controllers, and retransmitting the error message, through at least one error processing element, to other error processing elements connected to the at least one error processing element.
9 Assignments
0 Petitions
Accused Products
Abstract
Data transfer to computing elements is synchronized in a computer system that includes the computing elements and controllers that provide data from data sources to the computing elements. A request for data made by a computing element is intercepted and transmitted to the controllers. At least a first controller responds by transmitting requested data to the computing element and by indicating how a second controller will respond to the intercepted request.
-
Citations
20 Claims
-
1. A method of handling faults in a computer system, the computer system including computing elements, controllers that provide data from data sources to the computing elements, error reporting elements and error processing elements, the method comprising:
-
intercepting a request for data by a computing element;
transmitting the intercepted request to the controllers;
having at least one of the controllers respond by transmitting the requested data to the computing element;
detecting, through an error reporting element that comprises a computing element or a controller, an error condition and transmitting information about the error condition as an error message to error processing elements connected to the error reporting element, the error processing elements including at least two of the controllers, and retransmitting the error message, through at least one error processing element, to other error processing elements connected to the at least one error processing element.
-
-
2. A method of handling faults in a computer system, the computer system including computing elements, controllers that provide data from data sources to the computing elements, error reporting elements and error processing elements, the method comprising:
-
intercepting a request for data by a computing element;
transmitting the intercepted request to the controllers;
having at least one of the controllers respond by transmitting the requested data to the computing element;
detecting, through error reporting elements that comprise the computing elements or the controllers, an error condition and transmitting information about the error condition as error messages to error processing elements connected to the error reporting elements, the error processing elements including at least two of the controllers, and combining, through at least one error processing element, information from related error messages from multiple error reporting elements and using the combined information in identifying a source of the error condition. - View Dependent Claims (3, 4, 5, 8, 9, 10, 11, 12)
-
-
6. A computer system including:
-
computing elements, controllers that provide data from data sources to the computing elements, error reporting elements that include the computing elements, and error processing elements that include at least two of the controllers, wherein;
an error reporting element is configured to detect an error condition and transmit information about the error condition as an error message to error processing elements connected to the error reporting element, and at least one error processing element is configured to retransmit the error message to the other error processing elements connected to the at least one error processing element. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
7. A computer system including:
-
computing elements, controllers that provide data from data sources to the computing elements, error reporting elements that include the computing elements or the controllers, and error processing elements that include at least two of the controllers, wherein;
error reporting elements are configured to detect an error condition as error messages to error processing elements connected to the error reporting elements, and at least one error processing element is configured to combine information from related error messages from multiple error reporting elements and use the combined information in identifying a source of the error condition. - View Dependent Claims (13, 14, 15)
-
Specification