Systems and methods for retaining and using data block signatures in data protection operations
First Claim
1. A method of performing a copy operation, the method comprising:
- copying at least a first file comprising data blocks from a first storage device to a second storage device to create a secondary copy of the file comprising secondary copy data blocks,embedding deduplication signatures for the secondary copy data blocks in the secondary copy of the first file, wherein the embedded deduplication signatures are physically or logically contiguously embedded in the secondary copy of the file with respect to the secondary copy data blocks; and
in response to a command to copy one or more of the secondary copy data blocks to a third storage device;
comparing the embedded deduplication signatures to deduplication signatures stored at the third storage device without re-generating the values of the embedded deduplication signatures;
if the comparison indicates that a copy of the one or more of the secondary copy data blocks is already stored on the third storage device, storing a reference to the copy of the one or more secondary copy data blocks stored on the third storage device instead of transmitting the one or more secondary copy data blocks to the third storage device; and
if the comparison indicates that the copy of the one or more of the secondary copy data blocks is not stored on the third storage device, transmitting the one or more secondary copy data blocks to the third storage device.
2 Assignments
0 Petitions
Accused Products
Abstract
A system according to certain embodiments associates a signature value corresponding to a data block with one or more data blocks and a reference to the data block to form a signature/data word corresponding to the data block. The system further logically organizes the signature/data words into a plurality of files each comprising at least one signature/data word such that the signature values are embedded in the respective file. The system according to certain embodiments reads a previously stored signature value corresponding to a respective data block for sending from a backup storage system having at least one memory device to a secondary storage system. Based on an indication as to whether the data block is already stored on the secondary storage system, the system reads the data block from the at least one memory device for sending to the secondary storage system if the data block does not exist on the secondary storage system, wherein the signature value and not the data block is read from the at least one memory device if the data block exists on the secondary storage system.
-
Citations
20 Claims
-
1. A method of performing a copy operation, the method comprising:
-
copying at least a first file comprising data blocks from a first storage device to a second storage device to create a secondary copy of the file comprising secondary copy data blocks, embedding deduplication signatures for the secondary copy data blocks in the secondary copy of the first file, wherein the embedded deduplication signatures are physically or logically contiguously embedded in the secondary copy of the file with respect to the secondary copy data blocks; and in response to a command to copy one or more of the secondary copy data blocks to a third storage device; comparing the embedded deduplication signatures to deduplication signatures stored at the third storage device without re-generating the values of the embedded deduplication signatures; if the comparison indicates that a copy of the one or more of the secondary copy data blocks is already stored on the third storage device, storing a reference to the copy of the one or more secondary copy data blocks stored on the third storage device instead of transmitting the one or more secondary copy data blocks to the third storage device; and if the comparison indicates that the copy of the one or more of the secondary copy data blocks is not stored on the third storage device, transmitting the one or more secondary copy data blocks to the third storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system that performs a copy operation, the system comprising:
-
a first storage device that stores at least a first file comprising a plurality of data blocks; a second storage device that stores a secondary copy of the first file comprising secondary copy data blocks that are a copy of the data blocks on the first storage device; a plurality of embedded deduplication signatures in the secondary copy of the first file, wherein the plurality of embedded deduplication signatures are physically or logically contiguously embedded in the secondary copy of the file with respect to the secondary copy data blocks; and a third storage device that stores a copy of one or more of the secondary copy data blocks copied from the second storage device to the third storage device; computer hardware implementing a copy manager that directs; comparing the embedded deduplication signatures to deduplication signatures stored at the third storage device without re-generating the values of the embedded deduplication signatures; if the comparison indicates that a copy of the one or more of the secondary copy data blocks is already stored on the third storage device, storing a reference to the copy of the one or more secondary copy data blocks stored on the third storage device instead of transmitting the one or more secondary copy data blocks to the third storage device; and if the comparison indicates that the copy of the one or more of the secondary copy data blocks is not stored on the third storage device, transmitting the one or more secondary copy data blocks to the third storage device. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification