Estimating relative data importance in a dispersed storage network
First Claim
1. A method for execution by one or more processing modules of one or more computing devices of a dispersed storage network (DSN) having a plurality of storage units, the plurality of storage units storing a plurality of data objects in the form of encoded data slices, the method comprises:
- generating a first importance ranking for a first data object of the plurality of data objects;
generating a second importance ranking for a second data object of the plurality of data objects, the first importance ranking and the second importance ranking based on one or more ranking factor;
detecting a plurality of the encoded data slices that require rebuilding, wherein each encoded data slice of the plurality of the encoded data slices is a dispersed storage error encoded portion of a respective one of the plurality of data objects, and wherein the plurality of the encoded data slices that require rebuilding include at least one encoded data slice of the first data object and at least one encoded data slice of the second data object;
performing a comparison of the first importance ranking and the second importance ranking; and
based on the comparison, assigning respective rebuilding priority levels to the at least one encoded data slice of the first data object and the at least one encoded data slice of the second data object.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods for use in a dispersed storage network (DSN) to determine relative importance of data objects stored in the DSN for use in making prioritization decisions relating to necessary rebuilding of data. In various examples, relative importance rankings are generated based on heuristic measures and explicit ranking factors. The heuristics measures, which may be updated periodically, may include the frequency of access requests relating to a data object, the number of unique requestors of a data object, the age of a data object, the size of a data object, the amount of other data referenced by a data object, and a predicted amount of other data that would be rendered useless or inaccessible should a data object be lost. The explicit ranking factors may include a storage vault identifier and an importance marker associated with a data object. Higher priority data is preferably rebuilt prior to lower priority data.
-
Citations
20 Claims
-
1. A method for execution by one or more processing modules of one or more computing devices of a dispersed storage network (DSN) having a plurality of storage units, the plurality of storage units storing a plurality of data objects in the form of encoded data slices, the method comprises:
-
generating a first importance ranking for a first data object of the plurality of data objects; generating a second importance ranking for a second data object of the plurality of data objects, the first importance ranking and the second importance ranking based on one or more ranking factor; detecting a plurality of the encoded data slices that require rebuilding, wherein each encoded data slice of the plurality of the encoded data slices is a dispersed storage error encoded portion of a respective one of the plurality of data objects, and wherein the plurality of the encoded data slices that require rebuilding include at least one encoded data slice of the first data object and at least one encoded data slice of the second data object; performing a comparison of the first importance ranking and the second importance ranking; and based on the comparison, assigning respective rebuilding priority levels to the at least one encoded data slice of the first data object and the at least one encoded data slice of the second data object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for execution by one or more processing modules of one or more computing devices of a dispersed storage network (DSN) having a plurality of storage units, the plurality of storage units storing a plurality of data objects in the form of encoded data slices, the method comprises:
-
generating a first importance ranking for a first data object of the plurality of data objects; generating a second importance ranking for a second data object of the plurality of data objects, the first importance ranking and the second importance ranking based on at least one heuristic measure and at least one explicit ranking factor; detecting a plurality of the encoded data slices that require rebuilding, wherein each encoded data slice of the plurality of the encoded data slices is a dispersed storage error encoded portion of a respective one of the plurality of data objects, and wherein the plurality of the encoded data slices that require rebuilding include at least one encoded data slice of the first data object and at least one encoded data slice of the second data object; performing a comparison of the first importance ranking and the second importance ranking; based on the comparison, assigning a higher rebuilding priority level to the at least one encoded data slice of the first data object or the at least one encoded data slice of the second data object; and facilitating rebuilding of the at least one encoded data slice of the first data object and the at least one encoded data slice of the second data object based on the higher rebuilding priority level. - View Dependent Claims (13, 14)
-
-
15. A computing device of a group of computing devices of a dispersed storage network (DSN) having a plurality of storage units, the computing device comprises:
-
a network interface; a local memory; and a processing module operably coupled to the network interface and the local memory, wherein the processing module operates to; generate a first importance ranking for a first data object of the plurality of data objects; generate a second importance ranking for a second data object of the plurality of data objects, the first importance ranking and the second importance ranking based on one or more ranking factor; detect, via the network interface, a plurality of the encoded data slices that require rebuilding, wherein each encoded data slice of the plurality of the encoded data slices is a dispersed storage error encoded portion of a respective one of the plurality of data objects, and wherein the plurality of the encoded data slices that require rebuilding include at least one encoded data slice of the first data object and at least one encoded data slice of the second data object; perform a comparison of the first importance ranking and the second importance ranking; and based on the comparison, assign respective rebuilding priority levels to the at least one encoded data slice of the first data object and the at least one encoded data slice of the second data object. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification