Integration of cluster information with root cause analysis tool
First Claim
1. A method comprising:
- obtaining a selected event from a plurality of events related to a cluster of nodes;
identifying a resource in the cluster of nodes, whereinthe resource is related to the selected event;
selecting a subset of the plurality of events, whereinthe selecting comprisesusing cluster configuration data for the cluster of nodes,identifying a filter related to the resource, andapplying the filter to the plurality of events, andthe identifying the filter is based on the cluster configuration data and the selected event; and
providing the subset of the plurality of events, whereinthe subset of the plurality of events is configured to be used in determining a cause of the selected event.
8 Assignments
0 Petitions
Accused Products
Abstract
A method, system, computer system, and computer-readable medium to perform root cause analysis of failure of an application program in a clustering environment. Upon the occurrence of a problem or event of interest, cluster configuration data can be obtained to provide a common context for events occurring at different software layers supporting an application. Diagnostic information produced by the different software layers can be obtained from various log files, which are typically in different formats and on different nodes in the cluster. The diagnostic information can be viewed in the context of the cluster and filtered to identify events related to the failure. The related events can be presented in a time-ordered sequence to assist in analysis of the event of interest. Patterns of events that led to the failure can be identified and documented for use in further problem analysis and for taking preventative and/or corrective measures.
-
Citations
19 Claims
-
1. A method comprising:
-
obtaining a selected event from a plurality of events related to a cluster of nodes; identifying a resource in the cluster of nodes, wherein the resource is related to the selected event; selecting a subset of the plurality of events, wherein the selecting comprises using cluster configuration data for the cluster of nodes, identifying a filter related to the resource, and applying the filter to the plurality of events, and the identifying the filter is based on the cluster configuration data and the selected event; and providing the subset of the plurality of events, wherein the subset of the plurality of events is configured to be used in determining a cause of the selected event. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a selected node among a cluster of nodes, the selected node comprising a processor; obtaining means for obtaining a selected event from a plurality of events related to the cluster of nodes, wherein the obtaining means is coupled to the processor; identifying means for identifying a resource in the cluster of nodes, wherein the resource is related to the selected event; selecting means for selecting a subset of the plurality of events, wherein the selecting means is coupled to the processor, the selecting means comprises means for using cluster configuration data for the cluster of nodes, the selecting means further comprises means for identifying a filter related to the resource, and means for applying the filter to the plurality of events, and the means for identifying the filter comprises means for determining the filter based on the cluster configuration data and the selected event; and providing means for providing the subset of the plurality of events, wherein the providing means is coupled to the processor, and the subset of the plurality of events is configured to be used in determining a cause of the selected event. - View Dependent Claims (9)
-
-
10. A computer-readable storage medium comprising:
-
obtaining instructions configured to obtain a selected event from a plurality of events related to a cluster of nodes; identifying instructions configured to identify a resource in the cluster of nodes, wherein the resource is related to the selected event; selecting instructions configured to select a subset of the plurality of events, wherein the selecting instructions comprise instructions for using cluster configuration data for the cluster of nodes, the selecting instructions comprise; instructions for identifying a filter related to the resource, and instructions for applying the filter to the plurality of events, and the instructions for identifying the filter comprise instructions for identifying the filter based on the cluster configuration data and the selected event; and providing instructions configured to provide the subset of the plurality of events, wherein the subset of the plurality of events is configured to be used in determining a cause of the selected event. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A system comprising:
-
a selected node among a cluster of nodes, the selected node comprising a processor; an identifying module, stored in a memory coupled to the processor, configured to identify a resource in the cluster of nodes, wherein the resource is related to the selected event; an obtaining module, stored in the memory coupled to the processor, configured to obtain a selected event from a plurality of events related to the cluster of nodes; a selecting module, stored in the memory coupled to the processor, configured to select a subset of the plurality of events, wherein the selecting module comprises a cluster configuration data use module configured to use cluster configuration data for the cluster of nodes, the selecting module comprises a filtering module configured to identify a filter related to the resource, and apply the filter to the plurality of events, and the filtering module is configured to identify the filter based on the cluster configuration data and the selected event; and a providing module, stored in the memory coupled to the processor, configured to provide the subset of the plurality of events, and the subset of the plurality of events is configured to be used in determining a cause of the selected event. - View Dependent Claims (16, 17, 18, 19)
-
Specification