Optimization of a partially deduplicated file
First Claim
1. In a computing environment, a method performed at least in part on at least one processor, comprising, re-optimizing a partially deduplicated file having at least one undeduplicated portion-into a fully deduplicated file, in which the partially deduplicated file further has at least one deduplicated portion that is maintained in a chunk store and is referenced by metadata associated with the partially deduplicated file, the re-optimizing including processing each undeduplicated portion of the file to determine at least one chunk, committing the at least one chunk to a chunk store, and modifying metadata associated with the file to include a reference to each chunk that is committed to the chunk store.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject disclosure is directed towards transforming a file having at least one undeduplicated portion into a fully deduplicated file. For each of the at least one undeduplicated portion, a deduplication mechanism defines at least one chunk between file offsets associated with the at least one undeduplicated portion. Chunk boundaries associated with the at least one chunk are stored within deduplication metadata. The deduplication mechanism aligns the at least one chunk with chunk boundaries of at least one deduplicated portion of the file. Then, the at least one chunk is committed to a chunk store.
-
Citations
20 Claims
- 1. In a computing environment, a method performed at least in part on at least one processor, comprising, re-optimizing a partially deduplicated file having at least one undeduplicated portion-into a fully deduplicated file, in which the partially deduplicated file further has at least one deduplicated portion that is maintained in a chunk store and is referenced by metadata associated with the partially deduplicated file, the re-optimizing including processing each undeduplicated portion of the file to determine at least one chunk, committing the at least one chunk to a chunk store, and modifying metadata associated with the file to include a reference to each chunk that is committed to the chunk store.
- 15. In a computing environment, a system comprising, at least one processor, a memory communicatively coupled to the at least one processor and including components comprising, a deduplication mechanism configured to re-optimize a partially deduplicated file having at least one undeduplicated portion into a fully deduplicated file, in which the partially deduplicated file further has at least one deduplicated portion that is maintained in a chunk store and is referenced by metadata associated with the partially deduplicated file, the deduplication mechanism configured to chunk each undeduplicated portion into at least one re-optimized chunk, to commit at least one re-optimized chunk into the chunk store, and to modify the metadata to reference each chunk of the file that is committed to the chunk store.
-
19. One or more computer-readable storage media having computer-executable instructions, which when executed perform steps, comprising:
- defining a chunk for an undeduplicated portion of a partially deduplicated file, in which the partially deduplicated file further has at least one deduplicated portion that is maintained in a chunk store and is referenced by metadata associated with the partially deduplicated file;
aligning chunk boundaries of the chunk with file offsets associated with at least one deduplicated portion of the file;
updating the metadata associated with the file based upon the chunk boundaries; and
committing the chunk to a chunk store. - View Dependent Claims (20)
- defining a chunk for an undeduplicated portion of a partially deduplicated file, in which the partially deduplicated file further has at least one deduplicated portion that is maintained in a chunk store and is referenced by metadata associated with the partially deduplicated file;
Specification