TECHNIQUES FOR OPTIMIZING DATA REDUCTION BY UNDERSTANDING APPLICATION DATA
First Claim
1. A method of determining data reduction options for data sets comprising:
- receiving first data reduction information for a data set, wherein the first data reduction information includes first compression information regarding compression of chunks of the data set, wherein the first data reduction information includes first deduplication information regarding deduplication of chunks of the data set;
determining, in accordance with the first data reduction information for the data set, first settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set;
receiving, during a first time period when the first settings are effective, a plurality of writes directed to the data set, wherein the plurality of writes modify a plurality of chunks of the data set;
receiving second data reduction information for the data set, wherein the second data reduction information includes second compression information regarding compression of the plurality of chunks of the data set modified by the plurality of writes during the first time period, wherein the second data reduction information includes second deduplication information regarding deduplication of the plurality of chunks of the data set modified by the plurality of writes during the first time period; and
determining, in accordance with the second data reduction information for the data set, second settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for determining data reduction options may include: receiving first data reduction information regarding compression and deduplication of chunks of a data set; determining, in accordance with the first data reduction information for the data set, first settings denoting whether compression and deduplication are enabled or disabled for the data set; receiving, during a first time period when the first settings are effective, writes directed to the data set; receiving second data reduction information regarding compression and deduplication of chunks of the data set modified by writes during the first time period; and determining, in accordance with the second plurality of data reduction statistics for the data set, second settings denoting whether compression and deduplication are enabled or disabled for the data set. Data reduction information about the data set may include histograms regarding compressed sizes and detected duplications and may include data reduction information regarding different data set regions.
-
Citations
20 Claims
-
1. A method of determining data reduction options for data sets comprising:
-
receiving first data reduction information for a data set, wherein the first data reduction information includes first compression information regarding compression of chunks of the data set, wherein the first data reduction information includes first deduplication information regarding deduplication of chunks of the data set; determining, in accordance with the first data reduction information for the data set, first settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set; receiving, during a first time period when the first settings are effective, a plurality of writes directed to the data set, wherein the plurality of writes modify a plurality of chunks of the data set; receiving second data reduction information for the data set, wherein the second data reduction information includes second compression information regarding compression of the plurality of chunks of the data set modified by the plurality of writes during the first time period, wherein the second data reduction information includes second deduplication information regarding deduplication of the plurality of chunks of the data set modified by the plurality of writes during the first time period; and determining, in accordance with the second data reduction information for the data set, second settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a processor; and a memory comprising code stored thereon that, when executed, perform a method of determining data reduction options for data sets comprising; receiving first data reduction information for a data set, wherein the first data reduction information includes first compression information regarding compression of chunks of the data set, wherein the first data reduction information includes first deduplication information regarding deduplication of chunks of the data set; determining, in accordance with the first data reduction information for the data set, first settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set; receiving, during a first time period when the first settings are effective, a plurality of writes directed to the data set, wherein the plurality of writes modify a plurality of chunks of the data set; receiving second data reduction information for the data set, wherein the second data reduction information includes second compression information regarding compression of the plurality of chunks of the data set modified by the plurality of writes during the first time period, wherein the second data reduction information includes second deduplication information regarding deduplication of the plurality of chunks of the data set modified by the plurality of writes during the first time period; and determining, in accordance with the second data reduction information for the data set, second settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set.
-
-
16. A computer readable medium comprising code stored thereon that, when executed, performs a method of determining data reduction options for data sets comprising:
-
receiving first data reduction information for a data set, wherein the first data reduction information includes first compression information regarding compression of chunks of the data set, wherein the first data reduction information includes first deduplication information regarding deduplication of chunks of the data set; determining, in accordance with the first data reduction information for the data set, first settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set; receiving, during a first time period when the first settings are effective, a plurality of writes directed to the data set, wherein the plurality of writes modify a plurality of chunks of the data set; receiving second data reduction information for the data set, wherein the second data reduction information includes second compression information regarding compression of the plurality of chunks of the data set modified by the plurality of writes during the first time period, wherein the second data reduction information includes second deduplication information regarding deduplication of the plurality of chunks of the data set modified by the plurality of writes during the first time period; and determining, in accordance with the second data reduction information for the data set, second settings denoting whether compression is enabled or disabled for the data set and denoting whether deduplication is enabled or disabled for the data set. - View Dependent Claims (17, 18, 19, 20)
-
Specification