Critical systems inspector
First Claim
1. A method, comprising:
- monitoring a plurality of services;
collecting performance data for each service included in the plurality of services;
identifying, by operation of one or more computer processors, a cluster of related services by analyzing the performance data and determining that each service included in the cluster of related services exhibits a statistically similar pattern of performance data with respect to one or more performance metrics, wherein a first service is included in the cluster of related services, by;
calculating a similarity value for at least one service included in the plurality of services that is indicative of a statistical similarity between the performance data of the service and at least one of the performance data of the first service, the performance data of a second service, and a predetermined statistical pattern, anddetermining that the similarity values calculated for each service included in the cluster of related services exceed a predetermined threshold amount of similarity; and
determining, by operation of one or more computer processors, that the first service is a root cause of the pattern of performance data for each service included in the cluster of related services by determining that each service included in the cluster of related services, other than the first service, depends, either directly or indirectly, on the first service.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described for identifying a root cause of a pattern of performance data in a system including a plurality of services. Embodiments provide dependency information for each of the plurality of services, where at least one of the plurality of services is dependent upon a first one of the plurality of services. Each of the plurality of services is monitored to collect performance data for the respective service. Embodiments further analyze the performance data to identify a cluster of services that each follow a pattern of performance data. The first one of the services in the cluster of services is determined to be a root cause of the pattern of performance data, based on the determined dependency information for each of the plurality of services.
12 Citations
23 Claims
-
1. A method, comprising:
-
monitoring a plurality of services; collecting performance data for each service included in the plurality of services; identifying, by operation of one or more computer processors, a cluster of related services by analyzing the performance data and determining that each service included in the cluster of related services exhibits a statistically similar pattern of performance data with respect to one or more performance metrics, wherein a first service is included in the cluster of related services, by; calculating a similarity value for at least one service included in the plurality of services that is indicative of a statistical similarity between the performance data of the service and at least one of the performance data of the first service, the performance data of a second service, and a predetermined statistical pattern, and determining that the similarity values calculated for each service included in the cluster of related services exceed a predetermined threshold amount of similarity; and determining, by operation of one or more computer processors, that the first service is a root cause of the pattern of performance data for each service included in the cluster of related services by determining that each service included in the cluster of related services, other than the first service, depends, either directly or indirectly, on the first service. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable medium including instructions that, when executed by a processor, cause the processor to perform one or more operations comprising:
-
monitoring a plurality of services; collecting performance data for each service included in the plurality of services; identifying, by operation of one or more computer processors, a cluster of related services by analyzing the performance data and determining that each service included in the cluster of related services exhibits a statistically similar pattern of performance data with respect to one or more performance metrics, wherein a first service is included in the cluster of related services, by; calculating a similarity value for at least one service included in the plurality of services that is indicative of a statistical similarity between the performance data of the service and at least one of the performance data of the first service, the performance data of a second service, and a predetermined statistical pattern, and determining that the similarity values calculated for each service included in the cluster of related services exceed a predetermined threshold amount of similarity; and determining, by operation of one or more computer processors, that the first service is a root cause of the pattern of performance data for each service included in the cluster of related services by determining that each service included in the cluster of related services, other than the first service, depends, either directly or indirectly, on the first service. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A system, comprising:
-
a processor; and a memory including instructions that, when executed by the processor, cause the processor to perform one or more operations comprising; monitoring a plurality of services; collecting performance data for each service included in the plurality of services; identifying, by operation of one or more computer processors, a cluster of related services by analyzing the performance data and determining that each service included in the cluster of related services exhibits a statistically similar pattern of performance data with respect to one or more performance metrics, wherein a first service is included in the cluster of related services, by; calculating a similarity value for at least one service included in the plurality of services that is indicative of a statistical similarity between the performance data of the service and at least one of the performance data of the first service, the performance data of a second service, and a predetermined statistical pattern, and determining that the similarity values calculated for each service included in the cluster of related services exceed a predetermined threshold amount of similarity; and determining that the first service is a root cause of the pattern of performance data for each service included in the cluster of related services by determining that each service included in the cluster of related services, other than the first service, depends, either directly or indirectly, on the first service. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. A method, comprising:
-
determining dependency information for each of a plurality of services, wherein at least one of the plurality of services is dependent upon a first one of the plurality of services; collecting performance data for each respective service of the plurality of services; analyzing, by operation of one or more computer processors, the performance data to identify a cluster of services that each follow a pattern of performance data; determining the first one of the services in the cluster of services is a root cause of the pattern of performance data, based on the determined dependency information for each of the plurality of services; upon determining the first service is the root cause of the pattern of performance data, receiving a request to determine services within the plurality of services that are being affected by the first service; and analyzing the performance data to identify one or more of the plurality of services that are affected by the first service determined to be the root cause of the pattern of performance data by; calculating a similarity value for each of the plurality of services, indicative of a statistical similarity between the respective service and the first service, and determining that the similarity values calculated for the one or more services exceed a predetermined threshold amount of similarity.
-
Specification