Identification of root cause for a transaction response time problem in a distributed environment
First Claim
1. A method for identifying a cause for a response time problem for a transaction in a distributed computing system that includes a central server and a plurality of subsystems, the method comprising:
- storing data at each subsystem relating to sub-transactions of transactions performed by the subsystems;
discovering a problem in connection with completion of a particular transaction;
identifying each subsystem of the plurality of subsystems involved in the particular transaction;
forwarding the data stored at each identified subsystem to the central server, the forwarded data including instance data relating to the sub-transaction of the particular transaction performed by the identified subsystems and aggregate data relating to sub-transactions of transactions performed by the identified subsystems; and
performing a Root-Cause Analysis using the forwarded aggregate data and the forwarded instance data to identify the subsystem that caused the problem with the particular transaction.
1 Assignment
0 Petitions
Accused Products
Abstract
Method and apparatus for identifying a cause for a response time problem for a transaction in a distributed computing system that includes a central server and a plurality of subsystems. Data is stored at each subsystem relating to sub-transactions of transactions performed by the subsystems. When a problem is discovered in connection with the completion of a particular transaction, each subsystem of the plurality of subsystems that was involved in the particular transaction is identified, and both instance data relating to all of the sub-transactions of the particular transaction stored at each identified subsystem and current hourly aggregate data stored at each identified subsystem is forwarded to the central server. Root-Cause Analysis is then performed using the forwarded instance data and aggregate data to identify the particular subsystem that caused the transaction problem.
48 Citations
20 Claims
-
1. A method for identifying a cause for a response time problem for a transaction in a distributed computing system that includes a central server and a plurality of subsystems, the method comprising:
-
storing data at each subsystem relating to sub-transactions of transactions performed by the subsystems;
discovering a problem in connection with completion of a particular transaction;
identifying each subsystem of the plurality of subsystems involved in the particular transaction;
forwarding the data stored at each identified subsystem to the central server, the forwarded data including instance data relating to the sub-transaction of the particular transaction performed by the identified subsystems and aggregate data relating to sub-transactions of transactions performed by the identified subsystems; and
performing a Root-Cause Analysis using the forwarded aggregate data and the forwarded instance data to identify the subsystem that caused the problem with the particular transaction. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product, comprising computer executable instructions embodied in a computer usable medium, for identifying a cause for a response time problem for a transaction in a distributed computing system that includes a central server and a plurality of subsystems, the computer program product comprising:
-
first instructions for storing data at each subsystem relating to sub-transactions of transactions performed by the subsystems;
second instructions for discovering a problem in connection with completion of a particular transaction;
third instructions for identifying each subsystem of the plurality of subsystems involved in the particular transaction;
fourth instructions for forwarding the data stored at each identified subsystem to the central server, the forwarded data including instance data relating to the sub-transaction of the particular transaction performed by the identified subsystems and aggregate data relating to sub-transactions of transactions performed by the identified subsystems; and
fifth instructions for performing a Root-Cause Analysis using the forwarded aggregate data and the forwarded instance data to identify the subsystem that caused the problem with the particular transaction. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. An apparatus for identifying a cause for a response time problem for a transaction in a distributed computing system that includes a central server and a plurality of subsystems, the apparatus comprising:
-
an agent at each subsystem for storing data relating to sub-transactions of transactions performed by the subsystems;
a mechanism for discovering a problem in connection with completion of a particular transaction;
an identifying mechanism for identifying each subsystem of the plurality of subsystems involved in the particular transaction;
a forwarding mechanism for forwarding the data stored at each identified subsystem to the central server, the forwarded data including instance data relating to the sub-transaction of the particular transaction performed by the identified subsystems and aggregate data relating to sub-transactions of transactions performed by the identified subsystems; and
an analyzer at the central server for performing a Root-Cause Analysis using the forwarded aggregate data and the forwarded instance data to identify the subsystem that caused the problem with the particular transaction. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification