Method and apparatus for identifying problem causes in a multi-node system
First Claim
1. A machine implemented method of determining a cause of a problem with a service level objective that is represented by a model that includes nodes that represent elements in a system that are used to fulfill the service level objective and information that represents dependencies between the elements, the method comprising:
- receiving telemetry information that describes a condition of at least one element in the system, wherein the telemetry information can be applied to a particular procedure associated with a particular node in the model to determine if there is a problem associated with the particular element represented by the particular node;
applying at least a portion of the telemetry information to ones of the procedures to determine problem cause information describing which of the elements has a problem cause relating to the service level objective; and
determining relative contributions of the elements to a problem associated with the service level objective by analyzing the problem cause information and the dependencies between the elements.
2 Assignments
0 Petitions
Accused Products
Abstract
An SLO (service level objective) is represented by a model that includes nodes that represent elements in a system that are used to fulfill the SLO and information that represents dependencies between the elements. Telemetry information is received describing a condition of an element in the system. The telemetry information can be applied to a particular procedure associated with a particular node in the model to determine if there is a problem associated with the element represented by the particular node. At least a portion of the telemetry information is applied to procedures to determine problem cause information describing which elements have problems relating to the SLO. A relative contribution of elements to a problem associated with the SLO is determined by analyzing the problem cause information and the dependencies between the elements.
112 Citations
26 Claims
-
1. A machine implemented method of determining a cause of a problem with a service level objective that is represented by a model that includes nodes that represent elements in a system that are used to fulfill the service level objective and information that represents dependencies between the elements, the method comprising:
-
receiving telemetry information that describes a condition of at least one element in the system, wherein the telemetry information can be applied to a particular procedure associated with a particular node in the model to determine if there is a problem associated with the particular element represented by the particular node;
applying at least a portion of the telemetry information to ones of the procedures to determine problem cause information describing which of the elements has a problem cause relating to the service level objective; and
determining relative contributions of the elements to a problem associated with the service level objective by analyzing the problem cause information and the dependencies between the elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A machine readable medium having stored thereon a data structure describing a model that includes nodes that represent elements in a system that are used to fulfill a service level objective and information that represents dependencies between the elements, wherein the machine readable medium comprises a set of instructions which, when executed by one or more processors, causes the one or more processors to perform the following operations:
-
receive telemetry information that describes a condition of at least one element in the system, wherein the telemetry information can be applied to a particular procedure associated with a particular node in the model to determine if there is a problem associated with the particular element represented by the particular node;
apply at least a portion of the telemetry information to ones of the procedures to determine problem cause information describing which of the elements has a problem cause relating to the service level objective; and
determine relative contributions of the elements to a problem associated with the service level objective by analyzing the problem cause information and the dependencies between the elements. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
Specification