Autonomic program error detection and correction
First Claim
1. A method for autonomically diagnosing and correcting error conditions in a computing system of interrelated components and resources, the method comprising the steps:
- for each one of the components, reporting error conditions in a log file using both uniform conventions for naming dependent ones of the interrelated components and resources and also a common error reporting format;
detecting error conditions arising from individual ones of the interrelated components;
responsive to detecting an error condition in a specific one of the components, parsing a log associated with said specific one of the components to determine whether said error condition arose from a fault in one of the interrelated components and resources named in said associated log, and further parsing a log associated with said one of the interrelated components and resources to identify a cause for said fault; and
, correcting said fault.
1 Assignment
0 Petitions
Accused Products
Abstract
An autonomic system for diagnosing and correcting error conditions among interrelated components and resources. The system can include one or more commonly formatted log files utilizing standardized naming conventions for the interrelated components and resources. Each of the commonly formatted log files can include an association with one of the interrelated components and resources. An autonomic system administrator can be coupled to each of the interrelated components and resources and can be configured to parse the log files to identify both error conditions arising in associated ones of the interrelated components and resources, and also dependent ones of the interrelated components and resources giving rise to the identified error conditions. Preferably, the autonomic system can further include a codebase of analysis code and code insertion logic coupled to the autonomic system administrator and programmed to insert portions of the analysis code in selected ones of the interrelated components and resources.
-
Citations
15 Claims
-
1. A method for autonomically diagnosing and correcting error conditions in a computing system of interrelated components and resources, the method comprising the steps:
-
for each one of the components, reporting error conditions in a log file using both uniform conventions for naming dependent ones of the interrelated components and resources and also a common error reporting format;
detecting error conditions arising from individual ones of the interrelated components;
responsive to detecting an error condition in a specific one of the components, parsing a log associated with said specific one of the components to determine whether said error condition arose from a fault in one of the interrelated components and resources named in said associated log, and further parsing a log associated with said one of the interrelated components and resources to identify a cause for said fault; and
,correcting said fault. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An autonomic system for diagnosing and correcting error conditions among interrelated components and resources comprising:
-
a plurality of commonly formatted log files utilizing standardized naming conventions for the interrelated components and resources, each of said commonly formatted log files having an association with one of the interrelated components and resources; and
,an autonomic system administrator coupled to each of the interrelated components and resources and configured to parse said log files to identify both error conditions arising in associated ones of the interrelated components and resources, and also dependent ones of the interrelated components and resources giving rise to the identified error conditions. - View Dependent Claims (8, 9)
-
-
10. A machine readable storage having stored thereon a computer program for autonomically diagnosing and correcting error conditions in a computing system of interrelated components and resources, the computer program comprising a routine set of instructions for causing the machine to perform the steps:
-
for each one of the components, reporting error conditions in a log file using both uniform conventions for naming dependent ones of the interrelated components and resources and also a common error reporting format;
detecting error conditions arising from individual ones of the interrelated components;
responsive to detecting an error condition in a specific one of the components, parsing a log associated with said specific one of the components to determine whether said error condition arose from a fault in one of the interrelated components and resources named in said associated log, and further parsing a log associated with said one of the interrelated components and resources to identify a cause for said fault; and
, correcting said fault. - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification