Optimizing data storage among a plurality of data storage repositories
First Claim
Patent Images
1. A method for optimizing data storage among a plurality of data storage repositories, the method comprising:
- processing a data request submitted by a first application for storing data on one or more data storage mediums connected to a distributed communications network;
determining a number and location of duplicate copies of the data to be maintained on more than one storage medium so that established quality of service requirements associated with the first application are met;
managing duplicate copies of the data on said one or more data storage mediums to meet the established quality of service requirements for a group of applications that also use duplicated copies of said data while servicing the data request submitted by the first application,wherein managing the duplicate copies of the data comprises deduplicating copies of the data on one or more data storage mediums, in response to determining that the deduplication does not adversely affect one or more of the following in the distributed communication network;
data access latency, data recovery reliability, data storage costs, and data storage bandwidth, wherein said one or more data storage mediums are positioned at various data paths in the distributed communications network such that maintaining duplicate copies of the data in at least some of the one or more data storage mediums helps at least one of data access latency, data recovery reliability, data storage costs, and data storage bandwidth in the distributed communications network, andwherein said deduplicating comprises determining a unique ID associated with the data and a location type to remove the data item from.
4 Assignments
0 Petitions
Accused Products
Abstract
Machines and methods for optimizing data storage among a plurality of data storage repositories are provided. The method comprises processing a data request submitted for storing data on one or more data storage mediums; determining whether duplicate copies of the data are to be maintained on more than one storage medium so that established quality of service requirements are met; and managing duplicate copies of the data on said one or more data storage mediums to meet the established quality of service requirements while servicing the data request.
13 Citations
29 Claims
-
1. A method for optimizing data storage among a plurality of data storage repositories, the method comprising:
-
processing a data request submitted by a first application for storing data on one or more data storage mediums connected to a distributed communications network; determining a number and location of duplicate copies of the data to be maintained on more than one storage medium so that established quality of service requirements associated with the first application are met; managing duplicate copies of the data on said one or more data storage mediums to meet the established quality of service requirements for a group of applications that also use duplicated copies of said data while servicing the data request submitted by the first application, wherein managing the duplicate copies of the data comprises deduplicating copies of the data on one or more data storage mediums, in response to determining that the deduplication does not adversely affect one or more of the following in the distributed communication network;
data access latency, data recovery reliability, data storage costs, and data storage bandwidth, wherein said one or more data storage mediums are positioned at various data paths in the distributed communications network such that maintaining duplicate copies of the data in at least some of the one or more data storage mediums helps at least one of data access latency, data recovery reliability, data storage costs, and data storage bandwidth in the distributed communications network, andwherein said deduplicating comprises determining a unique ID associated with the data and a location type to remove the data item from. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for optimizing data storage among a plurality of data storage repositories, the system comprising:
-
a logic unit to process a data request submitted by a first application for storing data on one or more data storage mediums connected to a distributed communications network; a logic unit to determine a number and location of duplicate copies of the data to be maintained on more than one storage medium so that established quality of service requirements are met associated with the first application; and a logic unit to manage duplicate copies of the data on said one or more data storage mediums to meet the established quality of service requirements for a group of applications that also use duplicated copies of said data while servicing the data request submitted by the first application, wherein managing the duplicate copies of the data comprises deduplicating copies of the data on one or more data storage mediums, in response to determining that the deduplication does not affect one or more of the following in the distributed communication network;
data access latency, data recovery reliability, data storage costs, and data storage bandwidth,wherein said one or more data storage mediums are positioned at various data paths in the distributed communications network such that maintaining duplicate copies of the data in at least some of the one or more data storage mediums helps at least one of data access latency, data recovery reliability, data storage costs, and data storage bandwidth in the distributed communications network, and wherein said deduplicating comprises determining a unique ID associated with the data and a location type to remove the data item from. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer program product comprising a non-transitory computer readable storage medium having a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
process a data request submitted by a first application for storing data on one or more data storage mediums connected to a distributed communications network; determine a number and location of duplicate copies of the data ae to be maintained on more than one storage medium so that established quality of service requirements associated with the first application are met; and manage duplicate copies of the data on said one or more data storage mediums to meet the established quality of service requirements for a group of applications that also use duplicated copies of said data while servicing the data request submitted by the first application, wherein managing the duplicate copies of the data comprises deduplicating copies of the data on one or more data storage mediums, in response to determining that the deduplication does not adversely affect one or more of the following in the distributed communication network;
copies of the data on one or more data storage mediums based on factors that at least affect one or more of the following data access latency, data recovery, reliability, data storage costs, and data storage bandwidth,wherein said one or more data storage mediums are positioned at various data paths in the distributed communications network such that maintaining duplicate copies of the data in at least some of the one or more data storage mediums helps at least one of data access latency, data recovery reliability, data storage costs, and data storage bandwidth m the distributed communications network, and wherein said deduplicating comprises determining a unique ID associated with the data and a location type to remove the data item from. - View Dependent Claims (26, 27, 28, 29)
-
Specification