STORAGE CODES FOR DATA RECOVERY
First Claim
Patent Images
1. A method of recovering data, the method comprising:
- maintaining data storage units and parity storage units, where the parity storage units store parity data computed from a group of data stored in the data storage units; and
when a storage unit comprising either a first data storage or a first parity storage unit becomes unavailable, reading recovery data and using the recovery data to fully recover the data that was stored on the unavailable storage unit, wherein the recovery data consists of a portion of data in a combination of the data storage units and the parity storage units excluding the unavailable storage unit, and wherein the amount of data stored by the combination, in aggregate, is less than the amount of data in the group of data.
3 Assignments
0 Petitions
Accused Products
Abstract
A random permutation code is described which provides efficient repair of data nodes. A specific implementation of a permutation code is also described, followed by description of a MISER-Permutation code. Finally, an optimal repair strategy is explained that involves an iterative process of downloading the most effective available parity data, updating costs of remaining parity data, and repeating until the data is recovered.
29 Citations
20 Claims
-
1. A method of recovering data, the method comprising:
-
maintaining data storage units and parity storage units, where the parity storage units store parity data computed from a group of data stored in the data storage units; and when a storage unit comprising either a first data storage or a first parity storage unit becomes unavailable, reading recovery data and using the recovery data to fully recover the data that was stored on the unavailable storage unit, wherein the recovery data consists of a portion of data in a combination of the data storage units and the parity storage units excluding the unavailable storage unit, and wherein the amount of data stored by the combination, in aggregate, is less than the amount of data in the group of data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. One or more computer-readable storage media storing information to enable one or more computing devices to perform a process, the process comprising:
-
maintaining a plurality of nodes comprised of 2k parity nodes and 3k data nodes, wherein each of data node consists of k̂
2 data storage units, and wherein each of parity node consists of k̂
2 parity storage units; andrecovering a first data node when the first data node fails or is disabled, the recovering comprising reading no more than k data storage units per data node and/or parity node. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method to identify a reduced cost repair strategy for any given linear erasure code, the method comprising:
-
maintaining data storage units and parity storage units, where the parity storage units store parity data computed from a group of data stored in the data storage units through the linear erasure code; identifying a plurality of data nodes and parity nodes, each data node storing multiple data storage units, and each parity node storing multiple parity storage units; when a data node becomes unavailable, identifying multiple data storage units thereof that consequently are unavailable; and identifying a minimal repair set of data storage units on still-available data storage nodes and parity storage units that reduces a cost of repairing the unavailable data node. - View Dependent Claims (17, 18, 19, 20)
-
Specification