Methods and systems of distributed tracing
First Claim
1. A method of distributed tracing in a distributed application, the method comprising:
- observing a plurality of messages sent and received among components of the distributed application, wherein the components of the distributed application communicate with each other through messages, and the plurality of messages is based on one or more user requests;
constructing, based on observing a subset of the plurality of messages, a call flow graph that represents causal relationships between processes of the components;
obtaining a description of one or more call stacks from one or more per-process tracers associated with the distributed application;
recording and matching the per-process tracer description with the corresponding observed messages and selecting appropriate ones of the recorded description;
constructing, based on the call flow graph and the selected description, a distributed call stack indicative of an execution flow across processes of the components in the distributed application; and
using the distributed call stack for analyzing and debugging of the distributed application.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and methods are provided for distributed tracing in a distributed application. In one embodiment, a method includes observing a plurality of messages sent and received among components of the distributed application, generating a probabilistic model of a call flow from observed messages of the distributed system, and constructing a call flow graph based on the probabilistic model for the distributed application. Distributed tracing may include observing messages by performing the subscription-based observation techniques and operations to receive, message traces describing messages being communicated among components of the distributed application. In this regard, the tracing service may merge message traces from different instrumentation points with message traces obtained by observing message queues to generate a probabilistic model and call flow graph. A distributed application developer or other relevant user may benefit in efficiently and effectively tracking down a cause of an abnormal behavior or analyzing performance profiles.
-
Citations
19 Claims
-
1. A method of distributed tracing in a distributed application, the method comprising:
-
observing a plurality of messages sent and received among components of the distributed application, wherein the components of the distributed application communicate with each other through messages, and the plurality of messages is based on one or more user requests; constructing, based on observing a subset of the plurality of messages, a call flow graph that represents causal relationships between processes of the components; obtaining a description of one or more call stacks from one or more per-process tracers associated with the distributed application; recording and matching the per-process tracer description with the corresponding observed messages and selecting appropriate ones of the recorded description; constructing, based on the call flow graph and the selected description, a distributed call stack indicative of an execution flow across processes of the components in the distributed application; and using the distributed call stack for analyzing and debugging of the distributed application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system, comprising at least one processor and memory;
-
a plurality of service components of a distributed application, wherein the plurality of service components is executable on one or more processors, and components of the distributed application communicate with each other through messages; a message service component to provide communication between the plurality of service components of the distributed application; and a tracing service component configured to; observe a plurality of messages sent and received among components of the distributed application, wherein the components of the distributed application communicate with each other through messages, wherein one or more of the plurality of messages is a request and response message pair, and the plurality of messages is based on one or more user requests; construct, based on the observed plurality of messages, a call flow graph that represents causal relationships between processes of the components; obtain a description of one or more call stacks from one or more per-process tracers associated with the distributed application; record and match the per-process tracer description with the corresponding observed messages and select appropriate ones of the recorded description; construct, based on the call flow graph and the selected description, a distributed call stack indicative of an execution flow across processes of the components in the distributed application; and use the distributed call stack for analyzing and debugging of the distributed application. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
Specification