Preferential selection of candidates for delta compression
First Claim
1. A delta compression system, comprising:
- a delta processing module to delta compress a set of data chunks; and
a cache to store a first set of data chunk candidates;
a data storage system to store a second set of data chunk candidates;
a preferential selection module coupled to the cache, data storage system and delta processing module, the preferential selection module configured to;
select, using a selection mechanism, a set of data chunk candidates from at least one of the first and second set of data chunk candidates, wherein the selection mechanism ranks the set of data chunk candidates by similarity and tie-breaks based on location and status information when at least one of resource availability or throughput is higher than a predetermined high threshold value;
monitor delta compression throughput and resource utilization of the delta compression system, anddynamically adjust the selection mechanism that selects the set of data chunk candidates to be used in delta compression of the set of data chunks to be compressed, in response to detecting changes in the delta compression throughput and resource utilization, wherein the selection mechanism is not adjusted, in response to detecting that the resource availability and throughput are lower than a predetermined high threshold value and in response to detecting that the resource availability and throughput are higher than a predetermined low threshold value.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method and system for improving efficiency in a delta compression process in a data storage system selects a data chunk to delta compress and selects a set of candidate data chunks using a first selection mechanism. Throughput or resource utilization is monitored. A change is made to a second selection mechanism that increases similarity of the set of candidates with the selected data chunk to improve compression in response to determining high resource availability or high throughput level. A change is made to a third selection mechanism that increases throughput of the delta compression process in response to determining low resources availability or low throughput.
93 Citations
6 Claims
-
1. A delta compression system, comprising:
-
a delta processing module to delta compress a set of data chunks; and a cache to store a first set of data chunk candidates; a data storage system to store a second set of data chunk candidates; a preferential selection module coupled to the cache, data storage system and delta processing module, the preferential selection module configured to; select, using a selection mechanism, a set of data chunk candidates from at least one of the first and second set of data chunk candidates, wherein the selection mechanism ranks the set of data chunk candidates by similarity and tie-breaks based on location and status information when at least one of resource availability or throughput is higher than a predetermined high threshold value; monitor delta compression throughput and resource utilization of the delta compression system, and dynamically adjust the selection mechanism that selects the set of data chunk candidates to be used in delta compression of the set of data chunks to be compressed, in response to detecting changes in the delta compression throughput and resource utilization, wherein the selection mechanism is not adjusted, in response to detecting that the resource availability and throughput are lower than a predetermined high threshold value and in response to detecting that the resource availability and throughput are higher than a predetermined low threshold value. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification