MULTIPLE RETENTION PERIOD BASED REPRESENTATONS OF A DATASET BACKUP
First Claim
1. A method comprising:
- after receiving, by a storage system that deduplicates data, a request to back up a first dataset,creating at a first storage tier corresponding to the deduplicated storage system a first representation of the first dataset backup, wherein the first representation corresponds to a first retention period and the first representation references a plurality of data units that constitute the first dataset;
creating at the first storage tier a second representation of the first dataset backup, wherein the second representation corresponds to a second retention period that is longer than the first retention period and references the plurality of data units that constitute the first dataset;
replicating the second representation of the first dataset backup and the plurality of data units to a second storage tier from the first storage tier;
after detecting expiration of the first retention period,removing from the first storage tier those of the plurality of data units that are not referenced by a representation of another dataset backup; and
removing the first representation of the first dataset backup from the first storage tier.
1 Assignment
0 Petitions
Accused Products
Abstract
A storage tier manager creates different representations of a dataset backup for different retention periods. Each of the representations of the dataset backup is distinctly identifiable despite initially representing a same dataset backup. The representations are structured metadata corresponding to the dataset backup. One representation is a cached backup version of the dataset backup (“cached backup” or “cached representation”) provided for low latency access while residing at a storage tier of the backup appliance for a relatively short retention period according to a lifecycle management policy. The other representation is a cloud backup version of the dataset backup (“cloud backup” or “cloud representation”) provided for persisting into cloud storage for a longer retention period according to the lifecycle management policy.
-
Citations
20 Claims
-
1. A method comprising:
-
after receiving, by a storage system that deduplicates data, a request to back up a first dataset, creating at a first storage tier corresponding to the deduplicated storage system a first representation of the first dataset backup, wherein the first representation corresponds to a first retention period and the first representation references a plurality of data units that constitute the first dataset; creating at the first storage tier a second representation of the first dataset backup, wherein the second representation corresponds to a second retention period that is longer than the first retention period and references the plurality of data units that constitute the first dataset; replicating the second representation of the first dataset backup and the plurality of data units to a second storage tier from the first storage tier; after detecting expiration of the first retention period, removing from the first storage tier those of the plurality of data units that are not referenced by a representation of another dataset backup; and removing the first representation of the first dataset backup from the first storage tier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. One or more non-transitory machine readable media comprising program code for multi-tier data back up with multiple retention periods, the program code to:
-
create a plurality of representations of a dataset backup in a first storage tier for a plurality of retention periods associated with the dataset backup, wherein each of the plurality of representations comprises metadata about the dataset backup and references to a plurality of data units that constitute the dataset backup, wherein a first representation of the plurality of representations corresponds to the first storage tier and a first retention period of the plurality of retention periods; replicate the plurality of data units and a second representation of the plurality of representations to a second storage tier, wherein the second storage tier corresponds to a second retention period of the plurality of retention periods; and recover storage space occupied by those of the plurality of data units that are not referenced by a representation of another dataset backup at the first storage tier, after expiration of the first retention period. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A deduplicated storage system comprising:
-
a processor unit; and a machine readable medium comprising program code executable by the processor unit to cause the deduplicated storage system to, create a plurality of representations of a dataset backup in a first storage tier for a plurality of retention periods associated with the dataset backup, wherein each of the plurality of representations comprises metadata about the dataset backup and references to a plurality of data units that constitute the dataset backup, wherein a first representation of the plurality of representations corresponds to the first storage tier and a first retention period of the plurality of retention periods, wherein the first storage tier corresponds to the deduplicated storage system; replicate the plurality of data units and a second representation of the plurality of representations to a second storage tier, wherein the second storage tier corresponds to a second retention period of the plurality of retention periods; and recover storage space occupied by those of the plurality of data units that are not referenced by a representation of another dataset backup at the first storage tier, after expiration of the first retention period. - View Dependent Claims (18, 19, 20)
-
Specification