×

Identifying correlated events in a distributed system according to operational metrics

  • US 10,270,668 B1
  • Filed: 03/23/2015
  • Issued: 04/23/2019
  • Est. Priority Date: 03/23/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system, comprising:

  • a plurality of computing nodes of a distributed system of a service provider that implements a plurality of network-based services of a provider network that provides the network-based services for multiple clients of the service provider, the plurality of network-based services comprising;

    a monitoring service to monitor other network-based services of the plurality of network-based services, the monitoring service configured to;

    collect data values for a plurality of operational metrics from the other network-based services, wherein the operational metrics indicate operation of the network-based services provided for the clients or operation of the distributed system as a whole;

    evaluate at least some of the data values for the operational metrics to determine one or more measures of correlation amongst the operational metrics;

    detect, based at least in part on a particular measure of correlation between two or more of the operational metrics exceeding a threshold value, a correlated event at the network services; and

    perform, based on the detected correlated event, a responsive action with respect to the correlated event.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×