Performance monitoring in distributed systems using synchronized clocks and distributed event logs
First Claim
1. A method of performance monitoring in a distributed system, comprising the steps of:
- determining a set of significant events that are to be monitored in a distributed application in the distributed system;
providing each of a set of nodes applications associated with the significant events with a recorder function which when called generates a record that identifies the corresponding significant event and includes a time-stamp for the corresponding significant event obtained from a corresponding synchronized clock;
running an experiment in the distributed application that generates one or more of the significant events;
obtaining the records from the node applications and analyzing the records.
2 Assignments
0 Petitions
Accused Products
Abstract
A distributed system that provides performance monitoring capability across multiple nodes of a distributed application. A distributed system according to the present techniques includes a set of nodes that communicate via a network. A distributed application is performed by a set of cooperating node applications executing in the nodes. The distributed system implements techniques for generating time-stamp records for each of a set of significant events associated with one or more of the node applications. The time-stamp records provides a synchronized time base across the nodes for the significant events thereby enabling temporal ordering of the significant events.
103 Citations
19 Claims
-
1. A method of performance monitoring in a distributed system, comprising the steps of:
-
determining a set of significant events that are to be monitored in a distributed application in the distributed system;
providing each of a set of nodes applications associated with the significant events with a recorder function which when called generates a record that identifies the corresponding significant event and includes a time-stamp for the corresponding significant event obtained from a corresponding synchronized clock;
running an experiment in the distributed application that generates one or more of the significant events;
obtaining the records from the node applications and analyzing the records. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A distributed application, comprising:
-
a set of node applications which communicate via a network, each node application having at least one function which is associated with a significant event in the distributed application and further having a recorder function which when called by the function obtains a time-stamp for the significant event from a corresponding synchronized clock;
event log for each node application, each event log for holding a record of the corresponding significant event and time-stamp such that the time-stamps provide a synchronized time base in the distributed application for evaluating the significant events. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
Specification