ENHANCED RELIABILITY IN DEDUPLICATION TECHNOLOGY OVER STORAGE CLOUDS
First Claim
Patent Images
1. A method implemented in a computer infrastructure comprising a combination of hardware and software, the method comprising:
- determining a weight for each of a plurality of duplicate files based on parameters associated with a respective storage device of each of the plurality of duplicate files; and
designating one of the plurality of duplicate files as a master copy based on the determined weight.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for enhancing reliability in deduplication over storage clouds are provided. A method includes: determining a weight for each of a plurality of duplicate files based on parameters associated with a respective storage device of each of the plurality of duplicate files; and designating one of the plurality of duplicate files as a master copy based on the determined weight.
-
Citations
25 Claims
-
1. A method implemented in a computer infrastructure comprising a combination of hardware and software, the method comprising:
-
determining a weight for each of a plurality of duplicate files based on parameters associated with a respective storage device of each of the plurality of duplicate files; and designating one of the plurality of duplicate files as a master copy based on the determined weight. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system implemented in hardware and comprising a computer infrastructure operable to:
-
identify duplicate files stored at different storage devices; determine a weight for each one of the duplicate files based on parameters associated with the storage devices; and designate one of the duplicate files as a master copy based on the determined weights. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A computer program product comprising a computer usable storage medium having readable program code embodied in the storage medium, wherein the computer program product includes at least one component operable to:
-
determine a hash value for each of a plurality of files; determine a set of duplicate files based on the hash values; determine a weight for each one of the duplicate files, wherein the weight is based on parameters associated with storage devices; and designate a master copy of the set based on the weight of each one of the duplicate files; and nominate remaining files in the set, other than the master copy, for deletion. - View Dependent Claims (19, 20)
-
-
21. A method of deduplication, comprising:
providing a computer infrastructure operable to; determine a hash value for each of a plurality of files; define sets of the plurality files based on the hash values; and for each respective one of the sets; determine a highest weight file in the respective set, wherein the weight is based on parameters associated with storage devices; designate the highest weight file as a master copy for the respective set; and nominate remaining files in the respective set, other than the master copy, for deletion. - View Dependent Claims (22)
-
23. A computer system for file deduplication, the system comprising:
-
a CPU, a computer readable memory and a computer readable storage media; first program instructions to identify a set of duplicate files; second program instructions to determine a weight for each one of the duplicate files; and third program instructions to designate a master copy of the set based on the weight of each one of the duplicate files; and fourth program instructions to delete remaining files of the set, other than the master copy, and replace the remaining files with respective pointers pointing to the master copy, wherein the first, second, third, and fourth program instructions are stored on the computer readable storage media for execution by the CPU via the computer readable memory; the weight is based on parameters associated with storage devices and weighting factors defined for the parameters; and the parameters are related to at least one of reliability, health, and user preference of the storage devices. - View Dependent Claims (24, 25)
-
Specification