AUTOMATED TEST AND REPAIR METHOD AND APPARATUS APPLICABLE TO COMPLEX, DISTRIBUTED SYSTEMS
First Claim
1. A system to automatically test and repair a complex, distributed target system including hardware and software, the automated test and repair system comprising:
- a knowledge base memory storing information about the target system, including information about the network topology of the target system, system events and system faults;
one or more computer processors including specialized hardware and software implementing a system status module, a decision module, and a user interface module, all modules being in operative communication with the knowledge base memory;
a communications interface between the target system and system status module enabling the system status module to detect faults in the target system, determine the underlying cause or causes of a fault, and predict potential future faults in the target system based upon information stored in the knowledge base memory;
a decision module in operative communication with the system status module enabling the decision module to identify an appropriate response to a fault detected by the system status module, the response potentially including an automated repair of the fault depending upon the severity of the fault; and
a user interface module in operative communication with the decision module, the user interface module including a display presenting repair actions taken by the decision module.
1 Assignment
0 Petitions
Accused Products
Abstract
An intelligent system for automatically monitoring, diagnosing, and repairing complex hardware and software systems is presented. A number of functional modules enable the system to collect relevant data from both hardware and software components, analyze the incoming data to detect faults, further monitor sensor data and historical knowledge to predict potential faults, determine an appropriate response to fix the faults, and finally automatically repair the faults when appropriate. The system leverages both software and hardware modules to interact with the complex system being monitored. Additionally, the lessons learned on one system can be applied to better understand events occurring on the same or similar systems.
-
Citations
8 Claims
-
1. A system to automatically test and repair a complex, distributed target system including hardware and software, the automated test and repair system comprising:
-
a knowledge base memory storing information about the target system, including information about the network topology of the target system, system events and system faults; one or more computer processors including specialized hardware and software implementing a system status module, a decision module, and a user interface module, all modules being in operative communication with the knowledge base memory; a communications interface between the target system and system status module enabling the system status module to detect faults in the target system, determine the underlying cause or causes of a fault, and predict potential future faults in the target system based upon information stored in the knowledge base memory; a decision module in operative communication with the system status module enabling the decision module to identify an appropriate response to a fault detected by the system status module, the response potentially including an automated repair of the fault depending upon the severity of the fault; and a user interface module in operative communication with the decision module, the user interface module including a display presenting repair actions taken by the decision module. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification