×

Root cause detection and monitoring for storage systems

  • US 10,223,189 B1
  • Filed: 06/25/2015
  • Issued: 03/05/2019
  • Est. Priority Date: 06/25/2015
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a host computing device configured to host one or more virtual computing device instances, the host computing device configured to transmit storage commands generated by the one or more virtual computing device instances via a communications network, the one or more virtual computing device instances executing on behalf of a client computing device;

    a storage processing service, executed on one or more storage computing devices, the storage processing service configured to;

    obtain a storage command request from a client computing device of the one or more virtual computing devices;

    process the storage command request to generate a storage command processing result associated with at least one storage volume, the storage volume associated with the one or more storage computing devices; and

    collect storage command metric information based at least in part on the storage command processing result; and

    a storage monitoring service, executed on one or more computing devices, configured to;

    obtain, from the storage processing service, the collected storage command metric information;

    process the collected storage command metric information for the at least one storage volume;

    identify a correlation relationship of a first storage volume and a second storage volume across the one or more storage computing devices, the correlation relationship further indicating a first fault of the least one storage volume and a second fault of a logical storage component, wherein the logical storage component includes a logical storage level of the first storage volume and the second storage volume including the at least one storage volume;

    identify, based at least in part on the identified correlation relationship, one or more faulty storage volumes among the one or more storage computing devices;

    obtaining suppression threshold information corresponding to at least one storage volume, the suppression threshold information indicating that a notification for a storage system issue is to be suppressed;

    determine that one of the one or more faulty storage volumes corresponds to the suppression threshold information; and

    suppress notifications regarding the one faulty storage volume.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×