Selection of erasure code parameters for no data repair
First Claim
Patent Images
1. A method of storing a set of data including at least one chunk of data, comprising:
- receiving a value that designates a desired retention period over which a chunk of the set of data is to be retained;
selecting a pair of erasure coding parameters, N and K, wherein N represents a total number of fragments for storing the chunk and K represents a minimum number of the N fragments that are needed to ensure that the chunk can be recovered without data loss;
calculating a mean time to data loss (MTTDL) of the chunk by applying N and K in an equation for MTTDL;
testing whether the calculated MTTDL is at least as great as the designated retention period;
repeating the selecting, calculating and testing for at least one different value pair of N and K until values of N and K are identified for which the calculated MTTDL is at least as great as the designated retention period;
conducting, by a computing node on a network, an erasure coding operation on the chunk using erasure coding parameters N′ and
K′
, wherein N′ and
K′
are each at least as great as the respective identified values of N and K; and
storing the N′
fragments in multiple storage units, including at least one storage unit accessible to the computing node over the network.
9 Assignments
0 Petitions
Accused Products
Abstract
An improved data storage technique achieves a desired level of reliability by providing sufficient redundancy in erasure coded data to maintain the data, without repair, for a prescribed period of time. The improved technique employs a newly devised, continuous-time Markov chain model. The model can be applied in computerized systems to establish erasure coding parameters for storing and reliably maintaining data for a designated period of time, without any need to repair the data to reestablish an original or previous level of erasure coding redundancy.
-
Citations
20 Claims
-
1. A method of storing a set of data including at least one chunk of data, comprising:
-
receiving a value that designates a desired retention period over which a chunk of the set of data is to be retained; selecting a pair of erasure coding parameters, N and K, wherein N represents a total number of fragments for storing the chunk and K represents a minimum number of the N fragments that are needed to ensure that the chunk can be recovered without data loss; calculating a mean time to data loss (MTTDL) of the chunk by applying N and K in an equation for MTTDL; testing whether the calculated MTTDL is at least as great as the designated retention period; repeating the selecting, calculating and testing for at least one different value pair of N and K until values of N and K are identified for which the calculated MTTDL is at least as great as the designated retention period; conducting, by a computing node on a network, an erasure coding operation on the chunk using erasure coding parameters N′ and
K′
, wherein N′ and
K′
are each at least as great as the respective identified values of N and K; andstoring the N′
fragments in multiple storage units, including at least one storage unit accessible to the computing node over the network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computerized apparatus, comprising:
-
a set of processors; and memory, coupled to the set of processors, the memory arranged to store instructions executable by the set of processors, wherein the set of processors executing instructions from the memory forms a specialized circuit arranged to; receive a value that designates a desired retention period over which a chunk of a set of data is to be retained; select a pair of erasure coding parameters, N and K, wherein N represents a total number of fragments for storing the chunk and K represents a minimum number of the N fragments that are needed to ensure that the chunk can be recovered without data loss; calculate a mean time to data loss (MTTDL) of the chunk by applying N and K in an equation for MTTDL; test whether the calculated MTTDL is at least as great as the designated retention period; repeatedly select, calculate and test for at least one different value pair of N and K until values of N and K are identified for which the calculated MTTDL is at least as great as the designated retention period; conduct an erasure coding operation on the chunk using erasure coding parameters N′ and
K′
, wherein N′ and
K′
are each at least as great as the respective identified values of N and K; andstore the N′
fragments in at least one storage unit. - View Dependent Claims (17, 18, 19)
-
-
20. A computer program product including non-transitory computer readable medium having instructions which, when executed by a set of processors of a computerized apparatus, cause the set of processors to perform a method of storing a set of data including at least one chunk of data, the method comprising:
-
receiving a value that designates a desired retention period over which a chunk of the set of data is to be retained; selecting a pair of erasure coding parameters, N and K, wherein N represents a total number of fragments for storing the chunk and K represents a minimum number of the N fragments that are needed to ensure that the chunk can be recovered without data loss; calculating a mean time to data loss (MTTDL) of the chunk by applying N and K in an equation for MTTDL; testing whether the calculated MTTDL is at least as great as the designated retention period; repeating the selecting, calculating and testing for at least one different value pair of N and K until values of N and K are identified for which the calculated MTTDL is at least as great as the designated retention period; conducting an erasure coding operation on the chunk using erasure coding parameters N′ and
K′
, wherein N′ and
K′
are each at least as great as the respective identified values of N and K; andstoring the N′
fragments in at least one storage unit.
-
Specification