Modularized collaborative performance issue diagnostic system
First Claim
1. A method, comprising:
- detecting, in a computing system, a performance issue in the computing system based on a set of rules or criteria;
in response to detecting the performance issue,analyzing, in the computing system, one or more corresponding system behaviors to identify a root cause for the detected performance issue in the computing system, wherein the analyzing one or more corresponding system behaviors includes;
receiving a query identifying the detected performance issue;
in response to receiving the query identifying the detected performance issue, determining a corresponding system behavior at a time of the detected performance issue based on an event log containing records of tasks performed in the computing system at various times;
based on the determined corresponding system behavior, issuing another query to determine another corresponding system behavior that causes the determined system behavior based on the event log; and
repeating the determining and issuing operations using the another corresponding system behavior until the root cause for the detected performance issue is identified; and
performing, in the computing system, a responsive action in response to the identified root cause for the detected performance issue.
1 Assignment
0 Petitions
Accused Products
Abstract
A collaborative diagnostic system monitors events in a system and identifies a causality chain from a detected performance issue to the root cause of that performance issue. The collaborative diagnostic system includes multiple issue detectors, multiple analysis core modules, and multiple scenario modules that work together to identify the causality chain and root cause. Each issue detector is a module or component that includes logic to detect known behaviors in the system, such as performance issues in the system. Each analysis core module includes logic to analyze and correlate low level system behavior within a conceptual area. Within each analysis core module are one or more diagnostic modules that are specific to that analysis core module to help determine what is happening in the system. Each scenario module includes logic to take an appropriate responsive action in response to the root cause of a performance issue being determined.
-
Citations
20 Claims
-
1. A method, comprising:
-
detecting, in a computing system, a performance issue in the computing system based on a set of rules or criteria; in response to detecting the performance issue, analyzing, in the computing system, one or more corresponding system behaviors to identify a root cause for the detected performance issue in the computing system, wherein the analyzing one or more corresponding system behaviors includes; receiving a query identifying the detected performance issue; in response to receiving the query identifying the detected performance issue, determining a corresponding system behavior at a time of the detected performance issue based on an event log containing records of tasks performed in the computing system at various times; based on the determined corresponding system behavior, issuing another query to determine another corresponding system behavior that causes the determined system behavior based on the event log; and repeating the determining and issuing operations using the another corresponding system behavior until the root cause for the detected performance issue is identified; and performing, in the computing system, a responsive action in response to the identified root cause for the detected performance issue. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method performed in a computing system having a processor executing multiple processes and providing multiple analysis modules individually configured to analyze and correlate system behaviors in the computing system, the method comprising:
-
detecting, in a computing system, a performance issue of executing the one or more processes in the computing system based on a set of rules or criteria; in response to detecting the performance issue in the computing system, issuing a query to an analysis module for identifying a system behavior related to an operation performed by one of the processes at a time of the detected performance issue in the computing system based on an event log having records individually containing metadata describing system events occurred in the computing system at various times; receiving, from the analysis module, the identified system behavior of the operation performed by one of the processes at the time of the detected performance issue in the computing system; based on the identified system behavior of the operation performed by the one of the processes, issuing another query to the analysis module or another analysis module for determining another system behavior of another operation that causes the identified system behavior; and repeating the issuing and receiving operations using the determined another system behavior to determine one or more additional system behaviors until a root cause for the detected performance issue is identified; and performing, in the computing system, a responsive action in response to the identified root cause for the detected performance issue. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A computing system, comprising:
-
a processor; and a memory having instructions executable by the processor to provide multiple processes and analysis modules individually configured to analyze and correlate system behaviors in the computing system and to cause the computing system to; upon detecting, in the computing system, a performance issue of executing the one or more processes in the computing system based on a set of rules or criteria; issue a query to an analysis module for determining a corresponding system behavior at a time of the detected performance issue in the computing system based on an event log having records individually containing metadata regarding system events occurred in the computing system at various times; receive, from the analysis module, the determined corresponding system behavior; based on the received corresponding system behavior, issue another query to another analysis module to determine another corresponding system behavior that causes the determined system behavior; repeat the determining and issuing operations using the determined another corresponding system behavior until a root cause for the detected performance issue is identified; and perform, in the computing system, a responsive action in response to the identified root cause for the detected performance issue. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification