Apparatus and method for analyzing and correlating events in a system using a causality matrix
First Claim
1. A computer-implemented method for use in analyzing events in a system having a plurality of components arranged in a particular configuration, each component belonging to one of a plurality of component classes, the method comprising the steps of:
- (1) creating a first representation which defines a set of events which may occur for each class of components in the system independently of the particular configuration of the system;
(2) creating a second representation which defines propagations of events across one or more of the component classes in the system independently of the particular configuration of the system;
(3) creating a configuration specification for the system which defines instances of components specific to the configuration of the system;
(4) converting the first and second representations into a causality mapping based on the configuration specification, wherein the causality mapping comprises a mapping between events in the system and likely causes thereof; and
(5) storing the causality mapping resulting from step (4) in a computer storage device.
5 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method is provided for efficiently determining the source of problems in a complex system based on observable events. The problem identification process is split into two separate activities of (1) generating efficient codes for problem identification and (2) decoding the problems at runtime. Various embodiments of the invention contemplate creating a causality matrix which relates observable symptoms to likely problems in the system, reducing the causality matrix into a minimal codebook by eliminating redundant or unnecessary information, monitoring the observable symptoms, and decoding problems by comparing the observable symptoms against the minimal codebook using various best-fit approaches. The minimal codebook also identifies those observable symptoms for which the greatest benefit will be gained if they were monitored as compared to others. By defining a distance measure between symptoms and codes in the codebook, the invention can tolerate a loss of symptoms or spurious symptoms without failure. Changing the radius of the codebook allows the ambiguity of problem identification to be adjusted easily. The invention also allows probabilistic and temporal correlations to be monitored.
312 Citations
31 Claims
-
1. A computer-implemented method for use in analyzing events in a system having a plurality of components arranged in a particular configuration, each component belonging to one of a plurality of component classes, the method comprising the steps of:
-
(1) creating a first representation which defines a set of events which may occur for each class of components in the system independently of the particular configuration of the system; (2) creating a second representation which defines propagations of events across one or more of the component classes in the system independently of the particular configuration of the system; (3) creating a configuration specification for the system which defines instances of components specific to the configuration of the system; (4) converting the first and second representations into a causality mapping based on the configuration specification, wherein the causality mapping comprises a mapping between events in the system and likely causes thereof; and (5) storing the causality mapping resulting from step (4) in a computer storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. Apparatus for use in analyzing events in a system having a plurality of components arranged in a particular configuration, each component belonging to one of a plurality of component classes, the apparatus comprising:
-
means for converting (a) first representations which define a set of events which may occur for each class of components in the system independently of the particular configuration of the system, (b) second representations which define a propagation of events across one or more of the component classes in the system independently of the particular configuration of the system, and (c) a configuration specification for the system which defines instances of components specific to the configuration of the system into a causality mapping comprising a mapping between events in the system and likely causes thereof; and means for storing the causality mapping. - View Dependent Claims (16, 17, 18)
-
-
19. A computer-implemented method for use in analyzing events in a system having a plurality of components arranged in a particular configuration, each component belonging to one of a plurality of component classes, the method comprising the steps of:
-
(1) defining a set of events which can occur for each class of components in the system independently of the particular configuration of the system; (2) defining propagations of events across one or more of the component classes in the system independently of the particular configuration of the system; (3) defining a configuration specification for the system which defines instances of components specific to the configuration of the system; (4) in a computer, using the configuration specification to convert the defined set of events and the defined propagations of events into a causality representation comprising a mapping between events in the system and likely causes thereof; and (5) storing the causality representation in a computer storage device. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28)
-
- 29. A machine programmed with a computer program which receives (i) a set of events which can occur for each of a plurality of classes of components in a system independently of any particular configuration of the system, (ii) a set of propagations of events across one or more component classes in the system independent of any particular configuration of the system, and (iii) a configuration specification which defines a particular system configuration, wherein the computer program converts the set of events and the set of propagations of events into a causality mapping on the basis of the particular system configuration, wherein the causality mapping comprises a mapping between events in the system and likely causes thereof.
-
31. A computer-implemented method for use in analyzing events in a system having a plurality of components arranged in a particular configuration, each component belonging to one of a plurality of component classes, the method comprising the steps of:
-
(1) defining a set of events which can occur for each class of components in the system independently of the particular configuration of the system; (2) defining propagations of events across one or more of the component classes in the system independently of the particular configuration of the system; (3) providing a configuration specification for the system which defines instances of components specific to the configuration of the system; (4) in a computer, using the configuration specification provided in step (3) to convert the set of events defined in step (1) and the propagations of events defined in step (2) into a causality representation comprising a mapping between a first group of events in the system and a second group of events in the system, wherein each event in the second group is caused by one or more events in the first group; and (5) storing the causality representation in a computer storage device.
-
Specification