System and method for determining relevant computer performance events
First Claim
1. A method comprising, by a computer system:
- monitoring, in real-time, end-user transactions that pass through an end-to-end transaction path comprising a plurality of transaction-path nodes;
responsive to the monitoring, identifying at least one transaction-path node of the plurality of transaction-path nodes as a problem node based, at least in part, on an analysis of end-to-end response times for a group of the monitored end-user transactions;
determining one or more event types that are related to the at least one transaction-path node based, at least in part, on the one or more event types being at least indirectly connected to the at least one transaction-path node in at least one of an abstract model and a concrete model;
wherein the abstract model comprises a first probabilistic graphical model that represents a set of monitored-resource types, a plurality of event types, and a probabilistic relevance of the monitored-resource types to the plurality of event types, the set of monitored-resource types comprising a monitored-resource type of the at least one transaction-path node, the plurality of event types comprising the one or more event types;
wherein the concrete model comprises a second probabilistic graphical model that represents a real-time topology of monitored resources, the plurality of event types, and a probabilistic relevance of the monitored resources to the plurality of event types such that the monitored resources are instances of the monitored-resource types, the monitored resources comprising the at least one transaction path node;
for each of the one or more event types, inferring a first event-relevance weight from the abstract model;
for each of the one or more event types, inferring a second event-relevance weight from the concrete model;
for each of the one or more event types, determining an event relevance based, at least in part, on the first event-relevance weight and the second event-relevance weight;
identifying a set of currently-firing events that correspond to one or more of the one or more event types;
identifying most-relevant events among the set of currently-firing events based, at least in part, on the determined event relevance; and
publishing the identified most-relevant events to a user to facilitate troubleshooting of the problem node.
22 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes identifying at least one transaction-path node as a problem node based, at least in part, on an analysis of end-to-end response times for a group of transactions. The method further includes determining one or more event types for the at least one transaction-path node. Also, the method includes, for each of the one or more event types, inferring a first event-relevance weight from an abstract model. The method also includes, for each of the one or more event types, inferring a second event-relevance weight from a concrete model. Furthermore, the method includes, for each of the one or more event types, determining an event relevance based, at least in part, on the first event-relevance weight and the second event-relevance weight. Additionally, the method includes identifying most-relevant events among a set of active events based, at least in part, on the determined event relevance.
321 Citations
20 Claims
-
1. A method comprising, by a computer system:
-
monitoring, in real-time, end-user transactions that pass through an end-to-end transaction path comprising a plurality of transaction-path nodes; responsive to the monitoring, identifying at least one transaction-path node of the plurality of transaction-path nodes as a problem node based, at least in part, on an analysis of end-to-end response times for a group of the monitored end-user transactions; determining one or more event types that are related to the at least one transaction-path node based, at least in part, on the one or more event types being at least indirectly connected to the at least one transaction-path node in at least one of an abstract model and a concrete model; wherein the abstract model comprises a first probabilistic graphical model that represents a set of monitored-resource types, a plurality of event types, and a probabilistic relevance of the monitored-resource types to the plurality of event types, the set of monitored-resource types comprising a monitored-resource type of the at least one transaction-path node, the plurality of event types comprising the one or more event types; wherein the concrete model comprises a second probabilistic graphical model that represents a real-time topology of monitored resources, the plurality of event types, and a probabilistic relevance of the monitored resources to the plurality of event types such that the monitored resources are instances of the monitored-resource types, the monitored resources comprising the at least one transaction path node; for each of the one or more event types, inferring a first event-relevance weight from the abstract model; for each of the one or more event types, inferring a second event-relevance weight from the concrete model; for each of the one or more event types, determining an event relevance based, at least in part, on the first event-relevance weight and the second event-relevance weight; identifying a set of currently-firing events that correspond to one or more of the one or more event types; identifying most-relevant events among the set of currently-firing events based, at least in part, on the determined event relevance; and publishing the identified most-relevant events to a user to facilitate troubleshooting of the problem node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An information handling system comprising:
-
a processor and memory, wherein the processor and memory in combination are operable to implement a method comprising; monitoring, in real-time, end-user transactions that pass through an end-to-end transaction path comprising a plurality of transaction-path nodes; responsive to the monitoring, identifying at least one transaction-path node of the plurality of transaction-path nodes as a problem node based, at least in part, on an analysis of end-to-end response times for a group of the monitored end-user transactions; determining one or more event types that are related to the at least one transaction-path node based, at least in part, on the one or more event types being at least indirectly connected to the at least one transaction-path node in at least one of an abstract model and a concrete model; wherein the abstract model comprises a first probabilistic graphical model that represents a set of monitored-resource types, a plurality of event types, and a probabilistic relevance of the monitored-resource types to the plurality of event types, the set of monitored-resource types comprising a monitored-resource type of the at least one transaction-path node, the plurality of event types comprising the one or more event types; wherein the concrete model comprises a second probabilistic graphical model that represents a real-time topology of monitored resources, a plurality of event types, and a probabilistic relevance of the monitored resources to the plurality of event types such that the monitored resources are instances of the monitored-resource types, the monitored resources comprising the at least one transaction path node; for each of the one or more event types, inferring a first event-relevance weight from the abstract model; for each of the one or more event types, inferring a second event-relevance weight from the concrete model; for each of the one or more event types, determining an event relevance based, at least in part, on the first event-relevance weight and the second event-relevance weight; identifying a set of currently-firing events that correspond to one or more of the one or more event types; identifying most-relevant events among the set of currently-firing events based, at least in part, on the determined event relevance; and publishing the identified most-relevant events to a user to facilitate troubleshooting of the problem node. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-program product comprising a non-transitory computer-usable medium having computer-readable program code embodied therein, the computer-readable program code adapted to be executed by a processor to implement a method comprising:
-
monitoring, in real-time, end-user transactions that pass through an end-to-end transaction path comprising a plurality of transaction-path nodes; responsive to the monitoring, identifying at least one transaction-path node of the plurality of transaction-path nodes as a problem node based, at least in part, on an analysis of end-to-end response times for a group of the monitored end-user transactions; determining one or more event types that are related to the at least one transaction-path node based, at least in part, on the one or more event types being at least indirectly connected to the at least one transaction-path node in at least one of an abstract model and a concrete model; wherein the abstract model comprises a first probabilistic graphical model that represents a set of monitored-resource types, a plurality of event types, and a probabilistic relevance of the monitored-resource types to the plurality of event types, the set of monitored-resource types comprising a monitored-resource type of the at least one transaction-path node, the plurality of event types comprising the one or more event types; wherein the concrete model comprises a second probabilistic graphical model that represents a real-time topology of monitored resources, a plurality of event types, and a probabilistic relevance of the monitored resources to the plurality of event types such that the monitored resources are instances of the monitored-resource types, the monitored resources comprising the at least one transaction path node; for each of the one or more event types, inferring a first event-relevance weight from the abstract model; for each of the one or more event types, inferring a second event-relevance weight from the concrete model; for each of the one or more event types, determining an event relevance based, at least in part, on the first event-relevance weight and the second event-relevance weight; identifying a set of currently-firing events that correspond to one or more of the one or more event types; identifying most-relevant events among the set of currently-firing events based, at least in part, on the determined event relevance; and publishing the identified most-relevant events to a user to facilitate troubleshooting of the problem node. - View Dependent Claims (20)
-
Specification