Device type differentiation for redundancy coded data storage systems
First Claim
Patent Images
1. A computer-implemented method, comprising:
- storing a plurality of archives by at least;
generating a set of shards, a minimum quorum quantity of the shards in the set being usable, by a redundancy code, to generate original data of the plurality of archives, the set of shards including at least;
identity shards that contain the original data of the plurality of archives; and
encoded shards representing an encoded form of the original data of the plurality of archives;
causing the identity shards to be stored on at least a first data storage system, the first data storage system having greater performance capabilities than a second data storage system; and
causing the encoded shards to be stored on at least the second data storage system, the second data storage system having a lower failure rate than the first data storage system; and
in response to receiving a request for an archive among the stored plurality of archives, at least;
locating, in the first data storage system, a respective identity shard corresponding to the requested archive; and
retrieving the located respective identity shard so as to retrieve the requested archive.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques described and suggested herein include systems and methods for optimizing performance characteristics for data archives stored on data storage systems using redundancy coding techniques by placing different processed portions of the data in disparate systems. For example, redundancy coded shards, which may include identity shards that contain unencoded original data of archives, may be stored on different types of systems to optimize for various retrieval use cases and implemented environments. Implementing systems may monitor various performance characteristics so as to adaptively account for changes to some or all of the monitored parameters.
-
Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
storing a plurality of archives by at least; generating a set of shards, a minimum quorum quantity of the shards in the set being usable, by a redundancy code, to generate original data of the plurality of archives, the set of shards including at least; identity shards that contain the original data of the plurality of archives; and encoded shards representing an encoded form of the original data of the plurality of archives; causing the identity shards to be stored on at least a first data storage system, the first data storage system having greater performance capabilities than a second data storage system; and causing the encoded shards to be stored on at least the second data storage system, the second data storage system having a lower failure rate than the first data storage system; and in response to receiving a request for an archive among the stored plurality of archives, at least; locating, in the first data storage system, a respective identity shard corresponding to the requested archive; and retrieving the located respective identity shard so as to retrieve the requested archive. - View Dependent Claims (2, 3, 4)
-
-
5. A system, comprising:
at least one computing device that implements one or more services, wherein the one or more services at least; process a plurality of archives to generate a set of redundancy coded shards, a minimum quorum quantity of the redundancy coded shards in the set being usable, by a redundancy code, to generate original data of the plurality of archives, the set of redundancy coded shards including at least some identity shards that contain the original data of the plurality of archives; cause storage of the identity shards on at least a first data storage system accessible to the at least one computing device; cause storage of a remainder of the set of redundancy coded shards on at least a second data storage system that differs from the first data storage system; and in response to receiving a request for the original data of an archive of the plurality of archives, at least; cause the first data storage system to locate a corresponding identity shard; and cause retrieval of the located corresponding identity shard. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
13. A non-transitory computer-readable storage medium having stored thereon executable instructions that, as a result of execution by one or more processors of a computer system, cause the computer system to at least:
generate, from received archives and using a redundancy code, a set of shards, a minimum quorum quantity of the redundancy coded shards in the set being usable to generate any unavailable shard within the set of redundancy coded shards, the set of redundancy coded shards including identity shards that contain original data of the plurality of received archives; cause storage of the identity shards on at least a first data storage system accessible to the computer system; cause storage of a remainder of the set of redundancy coded shards on at least a second data storage system geographically separate from the first data storage system; and respond to requests for archive retrieval by at least; causing the first data storage system to locate a corresponding identity shard; and causing retrieval of the located corresponding identity shard. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
Specification