System and method for scheduling data storage replication over a network
First Claim
Patent Images
1. A method for replicating a group of data sets, the method comprising:
- generating a plurality of schedules from which one schedule is to be selected for replication, wherein each schedule contains a list of replication times for each data set in the group and each data set has a recovery point objective (RPO) requirement and a utilization estimation of computational resources for said data set'"'"'s replication;
calculating a fitness value for each of the schedules, wherein the fitness value is a function of first and second values, the first value being a computed combination of an interval of time for each data set that the RPO requirement of the data set is not satisfied, and the second value being a computed combination of the estimated utilization of computational resources at points in time during a timeframe for replicating the group of data sets;
selecting from the plurality of schedules, a schedule with a best fitness value among the fitness values of the schedules determined according to said calculating; and
replicating the group in accordance with the selected schedule.
2 Assignments
0 Petitions
Accused Products
Abstract
The replication of a group of data sets is coordinated into a single schedule based on the recovery point objective (RPO) of each data set. The resulting generated schedule satisfies each data set'"'"'s RPO requirements and minimizes bandwidth use. If necessary bandwidth exceeds available bandwidth, the resulting generated schedule attempts to best satisfy each data set'"'"'s RPO requirements given the available bandwidth.
24 Citations
20 Claims
-
1. A method for replicating a group of data sets, the method comprising:
-
generating a plurality of schedules from which one schedule is to be selected for replication, wherein each schedule contains a list of replication times for each data set in the group and each data set has a recovery point objective (RPO) requirement and a utilization estimation of computational resources for said data set'"'"'s replication; calculating a fitness value for each of the schedules, wherein the fitness value is a function of first and second values, the first value being a computed combination of an interval of time for each data set that the RPO requirement of the data set is not satisfied, and the second value being a computed combination of the estimated utilization of computational resources at points in time during a timeframe for replicating the group of data sets; selecting from the plurality of schedules, a schedule with a best fitness value among the fitness values of the schedules determined according to said calculating; and replicating the group in accordance with the selected schedule. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer system for replicating a group of data sets, the computer system comprising a processor programmed to:
-
generate a plurality of schedules from which one schedule is to be selected for replication, wherein each schedule contains a list of replication times for each data set in the group and each data set has a recovery point objective (RPO) requirement and a utilization estimation of computational resources said data set'"'"'s replication; calculate a fitness value for each of the schedules, wherein the fitness value is a function of first and second values, the first value being a computed combination of an interval of time for each data set that the RPO requirement of the data set is not satisfied, and the second value being a computed combination of the estimated utilization of computational resources at points in time during a timeframe for replicating the group of data sets; and select from the plurality of schedules, a schedule with a best fitness value among the calculated fitness values of the schedules. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable storage medium having stored therein a computer program for scheduling a group of data sets for replication, wherein a computer system executing the computer program carries out the steps of:
-
generating a plurality of schedules from which one schedule is to be selected for replication, wherein each schedule contains a list of replication times for each data set in the group and each data set has a recovery point objective (RPO) requirement and a utilization estimation of computational resources for said data set'"'"'s replication; calculating a fitness value for each of the schedules, wherein the fitness value is a function of first and second values, the first value being a computed combination of an interval of time for each data set that the RPO requirement of the data set is not satisfied, and the second value being a computed combination of the estimated utilization of computational resources at points in time during a timeframe for replicating the group of data sets; and selecting from the plurality of schedules, a schedule with a best fitness value among the fitness values of the schedules determined according to said calculating. - View Dependent Claims (18, 19, 20)
-
Specification