BACKUP AND RESTORE STRATEGIES FOR DATA DEDUPLICATION
First Claim
1. A method for data stream backup, comprising:
- identifying for backup a plurality of optimized data streams stored in a chunk store, the chunk store including each optimized data stream as a plurality of chunks and corresponding optimized stream metadata, the plurality of chunks including at least one data chunk, the corresponding optimized stream metadata referencing the at least one data chunk, and the chunk store including all included data chunks in a deduplicated manner; and
storing at least a portion of the chunk store in a backup storage to backup the plurality of optimized data streams.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for backup and restore of optimized data streams are described. A chunk store includes each optimized data stream as a plurality of chunks including at least one data chunk and corresponding optimized stream metadata. The chunk store includes data chunks in a deduplicated manner. Optimized data streams stored in the chunk store are identified for backup. At least a portion of the chunk store is stored in backup storage according to an optimized backup technique, an un-optimized backup technique, an item level backup technique, or a data chunk identifier backup technique. Optimized data streams stored in the backup storage may be restored. A file reconstructor includes a callback module that generates calls to a restore application to request optimized stream metadata and any referenced data chunks from the backup storage. The file reconstructor reconstructs the data streams from the referenced data chunks.
-
Citations
20 Claims
-
1. A method for data stream backup, comprising:
-
identifying for backup a plurality of optimized data streams stored in a chunk store, the chunk store including each optimized data stream as a plurality of chunks and corresponding optimized stream metadata, the plurality of chunks including at least one data chunk, the corresponding optimized stream metadata referencing the at least one data chunk, and the chunk store including all included data chunks in a deduplicated manner; and storing at least a portion of the chunk store in a backup storage to backup the plurality of optimized data streams. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for restoring files from backup, comprising:
-
receiving a request for an optimized data stream to be retrieved from a chunk store in backup storage, the request including an identifier for the optimized stream metadata corresponding to the data stream; generating a first call to a restore application based on the optimized stream metadata, the first call specifying a file name for a first chunk container in backup storage that stores optimized stream metadata identified by the optimized stream metadata identifier, and specifying an offset for the optimized stream metadata in the first chunk container; receiving the optimized stream metadata in response to the first call; determining at least one data chunk identifier referenced in the optimized stream metadata; generating at least one additional call to the restore application corresponding to the at least one data chunk identifier to obtain at least one data chunk from at least one chunk container in backup storage; and receiving the at least one data chunk in response to the at least one additional call. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A system, comprising:
-
a data backup module that receives an identification of a plurality of optimized data streams stored in a chunk store for backup, the chunk store including each optimized data stream as a plurality of chunks and corresponding optimized stream metadata, the plurality of chunks including at least one data chunk, the corresponding optimized stream metadata referencing the at least one data chunk, the chunk store including all included data chunks in a deduplicated configuration; and the data backup module being configured to store at least a portion of the chunk store in a backup storage to backup the plurality of optimized data streams. - View Dependent Claims (18, 19, 20)
-
Specification