Method of Enhancing De-Duplication Impact by Preferential Selection of Master Copy to be Retained
First Claim
1. A method of selecting data for de-duplication, comprising:
- taking as input identified duplicate copies of data in a storage system; and
selecting a single copy of said identified duplicate copies as a master copy on a select storage device, including interfacing with a dynamic storage management tool to obtain a cumulative demand for all of said identified duplicate copies and estimating performance utilization of each of said identified duplicate copies.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for enhancing the impact of data de-duplication by preferential selection of the master copy to be retained based on current loads and performance metrics of the storage media devices. The computer system is configured to take as input the identified duplicate copies of data and evaluating their locations in storage devices to determine the cumulative affects of retaining one of the identified duplicate copies as a master copy and optionally allocating a new location if needed. Once a master copy has been designated, the remaining identified duplicate copies are removed from storage.
30 Citations
18 Claims
-
1. A method of selecting data for de-duplication, comprising:
-
taking as input identified duplicate copies of data in a storage system; and selecting a single copy of said identified duplicate copies as a master copy on a select storage device, including interfacing with a dynamic storage management tool to obtain a cumulative demand for all of said identified duplicate copies and estimating performance utilization of each of said identified duplicate copies. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system, comprising:
-
a processor in communication with memory; at least two storage pools in communication with said processor; a storage management tool to identify all duplicate copies of data in the storage pools; and a duplicate manager in communication with said storage management tool to select and retain a single copy of said identified duplicate copies as a master copy on a select storage device, including said storage management tool to obtain a cumulative demand for all of said identified duplicate copies and performance utilization of each of said identified duplicate copies. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An article comprising:
a computer readable carrier including computer program instructions configured to manage duplicate copies of data, comprising; instructions to identify all duplicate copies of data in a storage system; and instructions to select and retain a single copy of said identified duplicate copies as a master copy on a select storage device, including interfacing with a dynamic storage management tool to obtain a cumulative demand for all of said identified duplicate copies and performance utilization of each of said identified duplicate copies. - View Dependent Claims (14, 15, 16, 17, 18)
Specification