Scheduled or gradual redundancy encoding schemes for data storage
First Claim
1. A computer-implemented method for optimizing data storage, comprising:
- under the control of one or more computer systems configured with executable instructions,determining, based at least in part on an analysis of at least one of a profile of a customer among a plurality of customers using the one or more computer systems and storing data on a storage system, age of the data, a pattern of access to the data, or one or more operational characteristics of the storage system, a schedule for a scheduled erasure coding scheme that applies one or more erasure codes to the data at one or more scheduled intervals according to the schedule;
determining, for each subset of the stored data and based at least in part on an access frequency of the subset of stored data, the one or more erasure codes to be applied to the stored data for at least one of the scheduled intervals; and
applying the determined erasure codes to the associated subset of data at the associated scheduled intervals.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for optimizing data storage are disclosed herein. In particular, methods and systems for implementing redundancy encoding schemes with data storage systems are described. The redundancy encoding schemes may be scheduled according to system and data characteristics. The schemes may span multiple tiers or layers of a storage system. The schemes may be generated, for example, in accordance with a transaction rate requirement, a data durability requirement or in the context of the age of the stored data. The schemes may be designed to rectify entropy-related effects upon data storage. The schemes may include one or more erasure codes or erasure coding schemes. Additionally, methods and systems for improving and/or accounting for failure correlation of various components of the storage system, including that of storage devices such as hard disk drives, are described.
-
Citations
24 Claims
-
1. A computer-implemented method for optimizing data storage, comprising:
under the control of one or more computer systems configured with executable instructions, determining, based at least in part on an analysis of at least one of a profile of a customer among a plurality of customers using the one or more computer systems and storing data on a storage system, age of the data, a pattern of access to the data, or one or more operational characteristics of the storage system, a schedule for a scheduled erasure coding scheme that applies one or more erasure codes to the data at one or more scheduled intervals according to the schedule; determining, for each subset of the stored data and based at least in part on an access frequency of the subset of stored data, the one or more erasure codes to be applied to the stored data for at least one of the scheduled intervals; and applying the determined erasure codes to the associated subset of data at the associated scheduled intervals. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A computer-implemented method for optimizing data storage, comprising:
under the control of one or more computer systems configured with executable instructions, determining, based at least in part on an analysis of one or more storage characteristics associated with data stored on a storage system, a redundancy encoding scheme that encodes the data with one or more error-correcting codes according to a schedule; and causing the error-correcting codes of the forward error correction scheme to be applied to the data in accordance with the schedule. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A data storage system, comprising:
-
one or more processors; and memory, including instructions executable by the one or more processors to cause the computer system to at least; determine, by analyzing one or more storage parameters corresponding to data stored on the system, a storage optimization plan that is effective to encode the data using one or more error correction codes according to a schedule; and cause the storage optimization plan to be implemented by causing the data to be encoded according to the schedule. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. One or more computer-readable storage media having collectively stored thereon executable instructions that, when executed by one or more processors of a computing resource provider'"'"'s computer system, cause the computer system to at least:
-
generate a redundancy encoding schedule that includes one or more error correcting codes that, when applied to data stored upon the media according to the schedule, satisfies one or more storage requirements relating to the stored data; and cause at least a subset of the data to be encoded in accordance with the generated redundancy encoding schedule. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification