Managing storage space based on multiple dataset backup versions
First Claim
1. A method comprising:
- after expiration of a retention period associated with a first backup for a first dataset in a first storage tier;
removing, by a computing device, from a listing of constituent data units of the first dataset a plurality of identifiers corresponding to a first set of the constituent data units that are shared with a second backup for a second dataset in the first storage tier;
marking, by the computing device, in the listing another plurality of identifiers corresponding to a second set of the constituent data units that are shared by a third backup for a third dataset in a second storage tier; and
removing, by the computing device, from the first storage tier the first and second sets of the constituent data units, after migrating the second set of the constituent data units to the second storage tier based on the marking.
1 Assignment
0 Petitions
Accused Products
Abstract
A storage tier manager creates different versions of a dataset backup for different retention periods. Each of the versions is distinctly identifiable despite initially representing a same dataset backup. One version can be referred to as a cached version of the dataset backup and another version can be referred to as a cloud version of the dataset backup. When the retention period expires for the cached version of the dataset backup, the storage tier manager migrates the cloud version of the dataset backup from the caching storage tier to the cloud storage tier. The storage tier manager can then recover storage space occupied by data that has been migrated, as long as that data is not shared with other cached versions of other dataset backups due to deduplication.
34 Citations
15 Claims
-
1. A method comprising:
after expiration of a retention period associated with a first backup for a first dataset in a first storage tier; removing, by a computing device, from a listing of constituent data units of the first dataset a plurality of identifiers corresponding to a first set of the constituent data units that are shared with a second backup for a second dataset in the first storage tier; marking, by the computing device, in the listing another plurality of identifiers corresponding to a second set of the constituent data units that are shared by a third backup for a third dataset in a second storage tier; and removing, by the computing device, from the first storage tier the first and second sets of the constituent data units, after migrating the second set of the constituent data units to the second storage tier based on the marking. - View Dependent Claims (2, 3, 4, 5)
-
6. A non-transitory machine-readable medium having stored thereon instructions for managing storage space based on multiple dataset backup versions comprising machine executable code that, when executed by at least one machine, causes the machine to:
after expiration of a retention period associated with a first backup for a first dataset in a first storage tier; remove from a listing of constituent data units of the first dataset a plurality of identifiers corresponding to a first set of the constituent data units that are shared with a second backup for a second dataset in the first storage tier; mark in the listing another plurality of identifiers corresponding to a second set of the constituent data units that are shared by a third backup for a third dataset in a second storage tier; and remove from the first storage tier the first and second sets of the constituent data units, after migrating the second set of the constituent data units to the second storage tier based on the marking. - View Dependent Claims (7, 8, 9, 10)
-
11. A computing device, comprising:
-
a memory containing a machine-readable medium comprising machine executable code having stored thereon instructions for managing storage space based on multiple dataset backup versions; and a processor coupled to the memory, the processor configured to execute the machine executable code to cause the processor to; after expiration of a retention period associated with a first backup for a first dataset in a first storage tier; remove from a listing of constituent data units of the first dataset a plurality of identifiers corresponding to a first set of the constituent data units that are shared with a second backup for a second dataset in the first storage tier; mark in the listing another plurality of identifiers corresponding to a second set of the constituent data units that are shared by a third backup for a third dataset in a second storage tier; and remove from the first storage tier the first and second sets of the constituent data units, after migrating the second set of the constituent data units to the second storage tier based on the marking. - View Dependent Claims (12, 13, 14, 15)
-
Specification