Deduplication of data on disk devices using low-latency random read memory
First Claim
1. A storage system for deduplicating blocks of data, the storage system comprising:
- a set of one or more disk devices for storing a plurality of blocks;
a set of one or more low-latency random read memory (LLRRM) devices for storing a plurality of blocks, an LLRRM device having lower latency in performing random read requests relative to a disk device; and
a deduplication layer configured for;
receiving a set of blocks;
for each received block, determining whether the received block matches a block stored on a disk device; and
upon determining that a matching stored block is found in a disk device, deduplicating the received block by;
transferring the matching stored block from an original address location on a disk device to a new address location in an LLRRM device; and
producing an index to the matching stored block, the index comprising the new address location of the matching stored block in the LLRRM device.
2 Assignments
0 Petitions
Accused Products
Abstract
Deduplication of data using a low-latency random read memory (LLRRM) is described herein. Upon receiving a block, if a matching block stored on a disk device is found, the received block is deduplicated by producing an index to the address location of the matching block. In some embodiments, a matching block having a predetermined threshold number of associated indexes that reference the matching block is transferred to LLRRM, the threshold number being one or greater. Associated indexes may be modified to reflect the new address location in LLRRM. Deduplication may be performed using a mapping mechanism containing mappings of deduplicated blocks to matching blocks, the mappings being used for performing read requests. Deduplication described herein may reduce read latency as LLRRM has relatively low latency in performing random read requests relative to disk devices.
-
Citations
23 Claims
-
1. A storage system for deduplicating blocks of data, the storage system comprising:
-
a set of one or more disk devices for storing a plurality of blocks; a set of one or more low-latency random read memory (LLRRM) devices for storing a plurality of blocks, an LLRRM device having lower latency in performing random read requests relative to a disk device; and a deduplication layer configured for; receiving a set of blocks; for each received block, determining whether the received block matches a block stored on a disk device; and upon determining that a matching stored block is found in a disk device, deduplicating the received block by; transferring the matching stored block from an original address location on a disk device to a new address location in an LLRRM device; and producing an index to the matching stored block, the index comprising the new address location of the matching stored block in the LLRRM device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A storage system for deduplicating blocks of data, the storage system comprising:
-
a set of one or more disk devices for storing a plurality of blocks; a set of one or more low-latency random read memory (LLRRM) devices for storing a plurality of blocks, an LLRRM device having lower latency in performing random read requests relative to a disk device; and a deduplication layer configured for; receiving a set of blocks; for each received block, determining whether the received block matches a block stored on a disk device; and upon determining that a matching stored block is found in a disk device, deduplicating the received block by; determining whether a number of associated indexes referencing the matching stored block equals a predetermined threshold number, the threshold number being two or greater; upon determining that the threshold number of associated indexes reference the matching stored block, transferring the matching stored block from an original address location on a disk device to a new address location in an LLRRM device; and producing an index to the matching stored block, the index comprising the new address location of the matching stored block in the LLRRM device. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
-
19. A storage system for deduplicating blocks of data based on a predetermined threshold number (THN) of sequential blocks, the storage system comprising:
-
a set of one or more disk devices for storing a plurality of blocks, each disk device comprising a set of tracks for storing blocks; a set of one or more low-latency random read memory (LLRRM) devices for storing a plurality of blocks, an LLRRM device having lower latency in performing random read requests relative to a disk device; and a deduplication layer configured for; receiving a set of blocks; determining that a series of THN or more received blocks (THN series) matches a sequence of THN or more stored blocks (THN sequence), a series of blocks comprising a set of consecutive blocks and a sequence of blocks comprising a series of blocks stored on a same track of a disk device, THN having a value of 2 or greater; deduplicating the blocks of the THN series using the matching THN sequence; for at least one received block, determining that the received block matches a block stored on a disk device; and deduplicating the received block by; transferring the matching stored block from an original address location on a disk device to a new address location in an LLRRM device; and producing an index to the matching stored block, the index comprising the new address location of the matching stored block in the LLRRM device. - View Dependent Claims (20, 21, 22, 23)
-
Specification