×

Dynamic scheduling of distributed storage management tasks using predicted system characteristics

  • US 10,168,953 B1
  • Filed: 05/20/2016
  • Issued: 01/01/2019
  • Est. Priority Date: 05/20/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method for scheduling storage management tasks over predicted user tasks in a distributed storage system, the method comprising:

  • receiving a set of historical stimulus records, comprising system task data records that characterize one or more system tasks that have been executed on the distributed storage system that comprises at least a first node and a second node, wherein the distributed storage system comprising a plurality of storage devices of a cluster, wherein any node distributed across the cluster of nodes that has a controller virtual machine utilizes its respective controller virtual machine to read and write to content on the plurality of storage devices in a storage pool;

    receiving, a set of historical response records comprising one or more system metrics associated with execution of the system tasks on the first node of the distributed storage system of the cluster, wherein a user task executed at the first node is observable at different nodes within the cluster, the one or more system metrics comprising a first portion that corresponds to measured metrics at the first node and a second portion that corresponds to results measured at a second node that are produced by executing the user task on the first node;

    generating a prediction model for the distributed storage system of the cluster based on a learning model formed from at least two stimulus records of the set of historical stimulus records and at least two response records of the set of historical response records;

    generating a set of forecasted user tasks predicted to be executed on the distributed storage system of cluster;

    applying the set of forecasted user tasks as new stimulus records to the prediction model to determine a set of forecasted system metrics for the distributed storage system of cluster, the set of forecasted system metrics being predicted to result from running the set of forecasted user tasks on the distributed storage system of cluster the distributed storage system of cluster; and

    selecting one or more distributed storage management tasks to be scheduled for execution on certain nodes of the distributed storage system of cluster based at least in part on a comparison between management task parameters and the set of forecasted system metrics, wherein the certain nodes of the distributed storage system are identified as being relevant to the one or more distributed storage management tasks.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×