Managing data deduplication in storage systems based on storage space characteristics
First Claim
1. A method for use in managing data deduplication in storage systems based on storage space characteristics, the method comprising:
- receiving a request to deduplicate a data object;
identifying a candidate data object for deduplicating the data object;
evaluating characteristics of storage tiers on which the data object and the candidate data object reside, wherein the data object resides on a first storage tier and the candidate data object resides on a second storage tier, wherein a data storage system includes the first storage tier and the second storage tier configured such that performance characteristics associated with the first storage tier are different from performance characteristics associated with the second storage tier; and
based on the evaluating, selecting a master deduplicated copy from a group consisting of the data object and the candidate data object, wherein the data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the first storage tier are higher than performance characteristics associated with the second storage tier, wherein the candidate data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the second storage tier are higher than performance characteristics associated with the first storage tier; and
based on the selecting, applying a deduplicating technique to the data object and the candidate data object, wherein the data object is deduplicated to the candidate data object by updating mapping information of the data object to point to the candidate data object upon selection of the candidate data object as the master deduplicated copy, wherein the candidate data object is deduplicated to the data object by updating mapping information of the candidate data object to point to the data object upon selection of the data object as the master deduplicated copy.
9 Assignments
0 Petitions
Accused Products
Abstract
A method is used in managing data deduplication in storage systems based on storage space characteristics. Characteristics of first and second storage tiers are evaluated. A first data object resides on the first storage tier and a second data object resides on the second storage tier. The first and second data objects are selected for applying a deduplicating technique. A data storage system includes the first and second storage tiers configured such that performance characteristics associated with the first storage tier is different from the second storage tier. Based on the evaluation, the deduplicating technique is applied to the first and second data objects.
-
Citations
14 Claims
-
1. A method for use in managing data deduplication in storage systems based on storage space characteristics, the method comprising:
-
receiving a request to deduplicate a data object; identifying a candidate data object for deduplicating the data object; evaluating characteristics of storage tiers on which the data object and the candidate data object reside, wherein the data object resides on a first storage tier and the candidate data object resides on a second storage tier, wherein a data storage system includes the first storage tier and the second storage tier configured such that performance characteristics associated with the first storage tier are different from performance characteristics associated with the second storage tier; and based on the evaluating, selecting a master deduplicated copy from a group consisting of the data object and the candidate data object, wherein the data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the first storage tier are higher than performance characteristics associated with the second storage tier, wherein the candidate data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the second storage tier are higher than performance characteristics associated with the first storage tier; and based on the selecting, applying a deduplicating technique to the data object and the candidate data object, wherein the data object is deduplicated to the candidate data object by updating mapping information of the data object to point to the candidate data object upon selection of the candidate data object as the master deduplicated copy, wherein the candidate data object is deduplicated to the data object by updating mapping information of the candidate data object to point to the data object upon selection of the data object as the master deduplicated copy. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for use in managing data deduplication in storage systems based on storage space characteristics, the system comprising:
-
first logic receiving a request to deduplicate a data object; second logic identifying a candidate data object for deduplicating the data object; third logic evaluating characteristics of storage tiers on which the data object and the candidate data object reside, wherein the data object resides on a first storage tier and the candidate data object resides on a second storage tier, wherein a data storage system includes the first storage tier and the second storage tier configured such that performance characteristics associated with the first storage tier are different from performance characteristics associated with the second storage tier; fourth logic selecting, based on the evaluating, a master deduplicated copy from a group consisting of the data object and the candidate data object, wherein the data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the first storage tier are higher than performance characteristics associated with the second storage tier, wherein the candidate data object is selected as the master deduplicated copy upon determining that performance characteristics associated with the second storage tier are higher than performance characteristics associated with the first storage tier; and fifth logic applying, based on the selecting, a deduplicating technique to the data object and the candidate data object, wherein the data object is deduplicated to the candidate data object by updating mapping information of the data object to point to the candidate data object upon selection of the candidate data object as the master deduplicated copy, wherein the candidate data object is deduplicated to the data object by updating mapping information of the candidate data object to point to the data object upon selection of the data object as the master deduplicated copy. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification