Prioritized repair of data storage failures
DCFirst Claim
1. A method for managing data storage over a network using a network computer that executes instructions that perform actions, comprising:
- if at least one repair event is associated with at least one new storage failure on a storage unit or a repair symbol unit, generating at least one new repair task that is associated with the at least one new storage failure, wherein the at least one new repair task is added to a task list that includes at least one other repair task for at least one current storage failure;
determining a priority value for each repair task in the task list based in part on a comparison of at least one characteristic value that is associated with each repair task in the task list, wherein a lower characteristic value generates a higher priority value over a higher characteristic value;
determining a resource budget based on a network bandwidth capacity for at least one different portion of the network, wherein the resource budget includes separate values for each of the different portions of the network;
promoting at least one repair task to be at least one new active repair task if the priority value for the at least one promoted repair task is higher than each other repair task and enough of the resource budget is available to execute the at least one new active repair task when each current working task is executing;
promoting at least one repair task to be the at least one new active repair task if a priority value for the at least one promoted task is higher than the priority level of at least one current active repair task, wherein each current active repair task is demoted to a repair task that has a lower priority value than the priority value for the at least one new active repair task until enough of the resource budget is released to execute the at least one new active repair task;
promoting the at least one repair task to the at least one new active repair task if a remainder of the resource budget enables execution of the at least one new active repair task when the priority value of the at least one new active repair task is lower than at least one other active repair task requiring other than the remainder to execute; and
executing each active repair task to repair at least one storage failure that is associated with the active repair task.
3 Assignments
Litigations
0 Petitions
Accused Products
Abstract
Embodiments are directed towards managing data storage that may experience a data failure. If a repair event is associated with a data storage failure, a new repair task may be generated and added to a task list. A priority value for each repair task in the task list may be determined based in part on the mean-time-to-data-loss (MTTDL) value associated with each repair task in the task list such that a lower MTTDL may indicate a higher priority value over a lower MTTDL. One or more repair tasks may be promoted to become active repair tasks based on the priority value the repair tasks such that the promoted repair tasks have a higher priority that than other repair tasks in the task list, if any. Each active repair task may be executed to repair one or more associated the storage failures.
-
Citations
26 Claims
-
1. A method for managing data storage over a network using a network computer that executes instructions that perform actions, comprising:
-
if at least one repair event is associated with at least one new storage failure on a storage unit or a repair symbol unit, generating at least one new repair task that is associated with the at least one new storage failure, wherein the at least one new repair task is added to a task list that includes at least one other repair task for at least one current storage failure; determining a priority value for each repair task in the task list based in part on a comparison of at least one characteristic value that is associated with each repair task in the task list, wherein a lower characteristic value generates a higher priority value over a higher characteristic value; determining a resource budget based on a network bandwidth capacity for at least one different portion of the network, wherein the resource budget includes separate values for each of the different portions of the network; promoting at least one repair task to be at least one new active repair task if the priority value for the at least one promoted repair task is higher than each other repair task and enough of the resource budget is available to execute the at least one new active repair task when each current working task is executing; promoting at least one repair task to be the at least one new active repair task if a priority value for the at least one promoted task is higher than the priority level of at least one current active repair task, wherein each current active repair task is demoted to a repair task that has a lower priority value than the priority value for the at least one new active repair task until enough of the resource budget is released to execute the at least one new active repair task; promoting the at least one repair task to the at least one new active repair task if a remainder of the resource budget enables execution of the at least one new active repair task when the priority value of the at least one new active repair task is lower than at least one other active repair task requiring other than the remainder to execute; and executing each active repair task to repair at least one storage failure that is associated with the active repair task. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system that is arranged for managing data storage over a network, comprising:
-
a network computer comprising; a transceiver that is operative to communicate over the network; a memory that is operative to store at least instructions; and a processor device that is operative to execute instructions that enable actions, including; if at least one repair event is associated with at least one new storage failure on a storage unit or a repair symbol unit, generating at least one new repair task that is associated with the at least one new storage failure, wherein the at least one new repair task is added to a task list that includes at least one other repair task for at least one current storage failure; determining a priority value for each repair task in the task list based in part on a comparison of at least one characteristic value that is associated with each repair task in the task list, wherein a lower characteristic value generates a higher priority value over a higher characteristic value; determining a resource budget based on a network bandwidth capacity for at least one different portion of the network, wherein the resource budget includes separate values for each of the different portions of the network; promoting at least one repair task to be at least one new active repair task if the priority value for the at least one promoted repair task is higher than each other repair task and enough of the resource budget is available to execute the at least one new active repair task when each current active repair task is executing; promoting at least one repair task to be the at least one new active repair task if a priority value for the at least one promoted task is higher than the priority level of at least one current active repair task, wherein each current active repair task is demoted to a repair task that has a lower priority value than the priority value for the at least one new active repair task until enough of the resource budget is released to execute the at least one new active repair task; promoting the at least one repair task to the at least one new active repair task if a remainder of the resource budget enables execution of the at least one new active repair task when the priority value of the at least one new active repair task is lower than at least one other active repair task requiring other than the remainder to execute; and executing each active repair task to repair at least one storage failure that is associated with the active repair task; and a client computer, comprising; a transceiver that is operative to communicate over the network; a memory that is operative to store at least instructions; and a processor device that is operative to execute instructions that enable actions, including; providing configuration information to the network computer. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A processor readable non-transitory storage media that includes instructions for managing data storage over a network, wherein execution of the instructions by a processor device enables actions, comprising:
-
if at least one repair event is associated with at least one new storage failure on a storage unit or a repair symbol unit, generating at least one new repair task that is associated with the at least one new storage failure, wherein the at least one new repair task is added to a task list that includes at least one other repair task for at least one current storage failure; determining a priority value for each repair task in the task list based in part on a comparison of at least one characteristic value that is associated with each repair task in the task list, wherein a lower characteristic value generates a higher priority value over a higher characteristic value; determining the resource budget based on a network bandwidth capacity for at least one different portion of the network, wherein the resource budget includes separate values for each of the different portions of the network; promoting at least one repair task to be at least one new active repair task if the priority value for the at least one promoted repair task is higher than each other repair task and enough of the resource budget is available to execute the at least one new active repair task when each current active repair task is executing; promoting at least one repair task to be the at least one new active repair task if a priority value for the at least one promoted task is higher than the priority level of at least one current active repair task, wherein each current active repair task is demoted to a repair task that has a lower priority value than the priority value for the at least one new active repair task until enough of the resource budget is released to execute the at least one new active repair task; promoting the at least one repair task to the at least one new active repair task if a remainder of the resource budget enables execution of the at least one new active repair task when the priority value of the at least one new active repair task is lower than at least one other active repair task requiring other than the remainder to execute; and executing each active repair task to repair at least one storage failure that is associated with the active repair task. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A network computer that is operative for managing data storage over a network, comprising:
-
a transceiver that is operative to communicate over a network; a memory that is operative to store at least instructions; and a processor device that is operative to execute instructions that enable actions, including; if at least one repair event is associated with at least one new storage failure on a storage unit or a repair symbol unit, generating at least one new repair task that is associated with the at least one new storage failure, wherein the at least one new repair task is added to a task list that includes at least one other repair task for at least one current storage failure; determining a priority value for each repair task in the task list based in part on a comparison of at least one characteristic value that is associated with each repair task in the task list, wherein a lower characteristic value generates a higher priority value over a higher characteristic value; determining a resource budget based on a network bandwidth capacity for at least one different portion of the network, wherein the resource budget includes separate values for each of the different portions of the network; promoting at least one repair task to be at least one new active repair task if the priority value for the at least one promoted repair task is higher than each other repair task and enough of the resource budget is available to execute the at least one new active repair task when each current active repair task is executing; promoting at least one repair task to be the at least one new active repair task if a priority value for the at least one promoted task is higher than the priority level of at least one current active repair task, wherein each current active repair task is demoted to a repair task that has a lower priority value than the priority value for the at least one new active repair task until enough of the resource budget is released to execute the at least one new active repair task; promoting the at least one repair task to the at least one new active repair task if a remainder of the resource budget enables execution of the at least one new active repair task when the priority value of the at least one new active repair task is lower than at least one other active repair task requiring other than the remainder to execute; and executing each active repair task to repair at least one storage failure that is associated with the active repair task. - View Dependent Claims (22, 23, 24, 25, 26)
-
Specification