Apparatus and method for event correlation and problem reporting
First Claim
2. A computer readable media containing code thereon, said code providing instructions to a computer system to determine the effects of one or more events in a system of managed components by executing the steps of:
- creating one or more configuration non-specific representations of types of managed components;
creating one or more configuration non-specific representations of events of said types of managed components;
creating configuration non-specific representations of relations along which the events and/or effects of said events propagate amongst the types of managed components, producing a data structure for determining the effects of an event by combining one or more of said configuration non-specific representations based on information of specific instances of managed components in the system; and
utilizing said data structure to determine the corresponding effects on one or more managed components caused by the one or more events.
9 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method on a computer readable media is provided for determining the source of a problem in a complex system of managed components based upon symptoms. The problem source identification process is split into different activities. Explicit configuration non-specific representations of types of managed components, their problems, symptoms and the relations along which the problems or symptoms propagate are created that can be manipulated by executable computer code. A data structure is produced for determining the source of a problem by combining one or more of the representations based on information of specific instances of managed components in the system. Computer code is then executed which uses the data structure to determine the source of the problem from one or more symptoms.
52 Citations
29 Claims
-
2. A computer readable media containing code thereon, said code providing instructions to a computer system to determine the effects of one or more events in a system of managed components by executing the steps of:
-
creating one or more configuration non-specific representations of types of managed components;
creating one or more configuration non-specific representations of events of said types of managed components;
creating configuration non-specific representations of relations along which the events and/or effects of said events propagate amongst the types of managed components, producing a data structure for determining the effects of an event by combining one or more of said configuration non-specific representations based on information of specific instances of managed components in the system; and
utilizing said data structure to determine the corresponding effects on one or more managed components caused by the one or more events. - View Dependent Claims (3, 4, 5, 6, 7)
-
-
8-1. The computer readable media as recited in claim 9, wherein the mismatch measure possesses a different weight to absence of a symptom data value than to presence of a symptom data value.
-
9. A computer readable media containing code thereon, said code providing instructions to a computer system for detecting in a distributed system problems among a plurality of problems which generate a plurality of symptoms, said problems and symptoms being mapped in a matrix of values by executing the steps of:
-
monitoring a plurality of symptom data values representing said plurality of symptoms;
determining a mismatch measure between each of a plurality of groups of said values in said matrix and said plurality of symptom data values; and
selecting one of said plurality of problems corresponding to one of said plurality of groups having the smallest mismatch measure. - View Dependent Claims (1, 8, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 27, 28, 29)
-
-
15. The computer readable media wherein the mismatch measure is a predetermined value.
-
22. A computer readable media containing code thereon, said code providing instructions to a computer system for analyzing events in a system by executing the steps of:
-
(1) partitioning a system domain representative of the system into a plurality of subdomains, each of said subdomains generating domain events, each domain event comprising one of the events in the system;
(2) providing a matrix of values for each subdomain, wherein each value corresponds to a mapping between one of said domain events and one of a plurality of likely other events in said system;
(3) monitoring for each subdomain event data values representing said domain events generated by said subdomain;
(4) determining for selected ones of said subdomains a mismatch measure between each of said matrix of values and said event data values for said subdomain and selecting the event having the smallest mismatch measure as the most likely cause event; and
(5) combining said selected mostly likely cause event in each of said selected subdomains to determine one or more likely cause events in said system. - View Dependent Claims (23, 24, 25, 26)
-
Specification