Methods and systems for performing root cause analysis
First Claim
1. A method of determining a cause of an occurrence in an information system made up of a plurality of monitored nodes, the method comprising:
- storing a plurality of rules, each said rule setting forth one or more conditions to be satisfied for indicating a particular cause of a particular occurrence;
receiving a first event message regarding a first event related to one of said nodes;
assigning a valid duration to the first event;
determining which of said rules have conditions corresponding to the first event;
calculating a matching ratio for any rules having conditions corresponding to the first event;
storing a matching state of conditions; and
specifying the cause of the occurrence based upon calculated matching ratios of said rules.
1 Assignment
0 Petitions
Accused Products
Abstract
A root cause analysis engine uses event durations and gradual deletion of events to improve analysis accuracy and reduce the number of required calculations. Matching ratios of relevant rules are recalculated every time notification of an event is received. The calculation results are held in a rule memory in the analysis engine. Each event has a valid duration, and when the duration has expired, that event is deleted from the rule memory. Events held in the rule memory can be deleted without affecting other events held in the rule memory. The analysis engine can then re-calculate the matching ratio of each rule by only performing the re-calculation with respect to affected rules related to the deleted event. The calculation cost can be reduced because analysis engine processes events incrementally or decrementally. Analysis engine can determine the most possible conclusion even if one or more condition elements were not true.
-
Citations
36 Claims
-
1. A method of determining a cause of an occurrence in an information system made up of a plurality of monitored nodes, the method comprising:
-
storing a plurality of rules, each said rule setting forth one or more conditions to be satisfied for indicating a particular cause of a particular occurrence; receiving a first event message regarding a first event related to one of said nodes; assigning a valid duration to the first event; determining which of said rules have conditions corresponding to the first event; calculating a matching ratio for any rules having conditions corresponding to the first event; storing a matching state of conditions; and specifying the cause of the occurrence based upon calculated matching ratios of said rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. An information system comprising:
-
a first computer having a first display, said first computer being in communication via a network with a plurality of monitored nodes in the information system; a plurality of rules accessible by said first computer, each said rule setting forth one or more conditions to be satisfied for indicating a cause when an occurrence takes place in one or more of said monitored nodes, wherein said first computer is configured to receive event messages regarding events related to occurrences at one or more of said monitored nodes and assign a valid duration to each event, wherein said first computer is configured to determine which of said rules have conditions corresponding to the events, calculate a matching ratio for any rules having conditions corresponding to the received events, and store a matching state of conditions, and wherein said first computer is configured to specify the cause of the occurrence based upon calculated matching ratios of said rules. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A method of determining a cause of an occurrence in an information system made up of a plurality of monitored nodes, the method comprising:
-
storing a plurality of rules, each said rule setting forth one or more conditions and one conclusion; generating expanded rules based upon said rules and a topology of a monitored portion of the information system so that each condition of said expanded rules corresponds to an event which can occur in the monitored portion of the information system; instantiating a plurality of conditions of a plurality of said expanded rules without repeat as condition objects in a memory; instantiating a plurality of conclusions of a plurality of said expanded rules as conclusion objects in said memory; associating a plurality of said condition objects with a plurality of said conclusion objects in said memory based upon a structure of said expanded rules; and executing a process which specifies the cause of the occurrence by activating or deactivating one or more of said condition objects when an event which affects satisfaction of the one or more condition objects occurs.
-
-
36. An information system comprising:
-
a first computer in communication via a network with a plurality of monitored nodes in the information system; a plurality of rules accessible by said first computer, each said rule setting forth one or more conditions and one conclusion; wherein said first computer is configured to generate expanded rules based upon said rules and a topology of a monitored portion of the information system so that each condition of said expanded rules corresponds to an event which can occur in the monitored portion of the information system; wherein said first computer is configured to instantiate a plurality of conditions of a plurality of said expanded rules without repeating as condition objects in a memory; wherein said first computer is configured to instantiate a plurality of conclusions of a plurality of said expanded rules as conclusion objects in said memory; wherein said first computer is configured to associate a plurality of said condition objects with a plurality of said conclusion objects in said memory based upon a structure of said expanded rules; and wherein said first computer is configured to execute a process which specifies the cause of the occurrence by activating or deactivating one or more of said condition objects when an event which affects satisfaction of the one or more condition objects occurs.
-
Specification