Optimization of a Partially Deduplicated File
First Claim
1. In a computing environment, a method performed at least in part on at least one processor, comprising, re-optimizing a file having at least one undeduplicated portion into a fully deduplicated file, including processing each undeduplicated portion of the file to determine at least one chunk, committing the at least one chunk to a chunk store, and modifying metadata associated with the file to include a reference to each chunk.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject disclosure is directed towards transforming a file having at least one undeduplicated portion into a fully deduplicated file. For each of the at least one undeduplicated portion, a deduplication mechanism defines at least one chunk between file offsets associated with the at least one undeduplicated portion. Chunk boundaries associated with the at least one chunk are stored within deduplication metadata. The deduplication mechanism aligns the at least one chunk with chunk boundaries of at least one deduplicated portion of the file. Then, the at least one chunk is committed to a chunk store.
114 Citations
20 Claims
- 1. In a computing environment, a method performed at least in part on at least one processor, comprising, re-optimizing a file having at least one undeduplicated portion into a fully deduplicated file, including processing each undeduplicated portion of the file to determine at least one chunk, committing the at least one chunk to a chunk store, and modifying metadata associated with the file to include a reference to each chunk.
- 15. In a computing environment, a system, comprising, a deduplication mechanism configured to re-optimize a file having at least one undeduplicated portion into a fully deduplicated file, the deduplication mechanism configured to chunk each undeduplicated portion into at least one re-optimized chunk, to commit at least one re-optimized chunk into a chunk store prior to fully re-optimizing the file, and to modify file metadata that represents each chunk of the file that is committed.
-
19. One or more computer-readable media having computer-executable instructions, which when executed perform steps, comprising:
-
defining a chunk for an undeduplicated portion of a file; aligning chunk boundaries of the chunk with file offsets associated with at least one deduplicated portion of the file; updating metadata associated with the file based upon the chunk boundaries; and committing the chunk to a chunk store. - View Dependent Claims (20)
-
Specification