Adaptive problem determination and recovery in a computer system
First Claim
1. A computer-based method for providing problem determination and error recovery features to a computing environment, the method comprising:
- receiving information regarding a status of the computing environment;
identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and
applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed.
3 Assignments
0 Petitions
Accused Products
Abstract
A method, computer program product, and data processing system for recognizing, tracing, diagnosing, and repairing problems in an autonomic computing system is disclosed. Rules and courses of actions to follow in logging data, in diagnosing faults (or threats of faults), and in treating faults (or threats of faults) are formulated using an adaptive inference and action system. The adaptive inference and action system includes techniques for conflict resolution that generate, prioritize, modify, and remove rules based on environment-specific information, accumulated time-sensitive data, actions taken, and the effectiveness of those actions. Thus, the present invention enables a dynamic, autonomic computing system to formulate its own strategy for self-administration, even in the face of changes in the configuration of the system.
-
Citations
63 Claims
-
1. A computer-based method for providing problem determination and error recovery features to a computing environment, the method comprising:
-
receiving information regarding a status of the computing environment;
identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and
applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer program product in a computer-readable medium comprising functional descriptive material that, when executed by a computer, enables the computer to perform acts including:
-
receiving information regarding a status of the computing environment;
identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and
applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A data processing system comprising:
-
means for receiving information regarding a status of the computing environment;
means for identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and
means for applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed. - View Dependent Claims (44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63)
-
Specification