Generating backup sets to a specific point in time
First Claim
1. A computer-implemented method, in a data processing system, for generating a backup set, comprising:
- receiving a request for generation of a backup set from two or more existing backups, wherein the request includes a point in time for which the backup set is to be generated, wherein the point in time is a point in time prior to a current time, and wherein using the point in time that is prior to the current time causes the backup set that is to be generated from the two or more existing backups to be a collection of data objects from the two or more existing backups that would have been the most recently backed-up version of the data processing system'"'"'s data objects at the point in time that is prior to the current time as though the backup set had been created at the point in time that is prior to the current time;
identifying versions of the data objects in the two or more existing backups in a backup storage system that would have been a most recently backed-up version of the data objects at the point in time that is prior to the current time as though the backup set had been created at that point in time that is prior to the current time, wherein identifying the versions of the data objects in the two or more existing backups in the backup storage system comprises;
identifying a first candidate set of the versions of the data objects in the two or more existing backups in the backup storage system that have backup timestamps equal to or prior to the point in time, wherein identifying the first candidate set of versions of the data objects, further comprises;
retrieving backup catalog information for the versions of the data objects from the two or more existing backups; and
searching entries in the backup catalog information to identify entries having a backup timestamp that is equal to or prior to the point in time; and
identifying a second candidate set of the versions of the data objects in the first candidate set that have deactivation times on or before the point in time, wherein identifying the second candidate set of versions of the data objects in the first candidate set, further comprises;
searching the entries in the first candidate set of the backup catalog information to identify entries having a deactivation time that is on or before the point in time;
storing the versions of data objects that are identified in the first candidate set as active and are not identified in the second candidate set as being deactivated on or before the point in time as the backup set for the point in time that is prior to the current time;
searching the versions of the data objects in the two or more existing backups in the backup storage system to identify data objects having a backup timestamp that is older than a retention time parameter;
determining, for each version of a data object, in the versions of the data objects in the two or more existing backups, having a backup timestamp older than the retention time parameter, whether the version is an active version of the data object; and
deleting the version of the data object from the two or more existing backups in response to the version of the data object is not an active version of the data object.
1 Assignment
0 Petitions
Accused Products
Abstract
Generating backup sets for a specific point in time from backup data stored in a backup storage device is provided. A backup set, in accordance with an illustrative embodiment, is a collection of the files and data that would have been the most recently backed-up version of a computer'"'"'s files and data at the particular point in time if the backup set had been created at that point in time. In order to generate the backup set, versions of data objects are identified in the backup storage system that have backup timestamps equal to or prior to the designated point in time and that have deactivation times that are after the designated point in time. In addition, a mechanism for deleting versions of data objects based on a retention time and an inactive version limit is provided.
-
Citations
10 Claims
-
1. A computer-implemented method, in a data processing system, for generating a backup set, comprising:
-
receiving a request for generation of a backup set from two or more existing backups, wherein the request includes a point in time for which the backup set is to be generated, wherein the point in time is a point in time prior to a current time, and wherein using the point in time that is prior to the current time causes the backup set that is to be generated from the two or more existing backups to be a collection of data objects from the two or more existing backups that would have been the most recently backed-up version of the data processing system'"'"'s data objects at the point in time that is prior to the current time as though the backup set had been created at the point in time that is prior to the current time; identifying versions of the data objects in the two or more existing backups in a backup storage system that would have been a most recently backed-up version of the data objects at the point in time that is prior to the current time as though the backup set had been created at that point in time that is prior to the current time, wherein identifying the versions of the data objects in the two or more existing backups in the backup storage system comprises; identifying a first candidate set of the versions of the data objects in the two or more existing backups in the backup storage system that have backup timestamps equal to or prior to the point in time, wherein identifying the first candidate set of versions of the data objects, further comprises; retrieving backup catalog information for the versions of the data objects from the two or more existing backups; and searching entries in the backup catalog information to identify entries having a backup timestamp that is equal to or prior to the point in time; and identifying a second candidate set of the versions of the data objects in the first candidate set that have deactivation times on or before the point in time, wherein identifying the second candidate set of versions of the data objects in the first candidate set, further comprises; searching the entries in the first candidate set of the backup catalog information to identify entries having a deactivation time that is on or before the point in time; storing the versions of data objects that are identified in the first candidate set as active and are not identified in the second candidate set as being deactivated on or before the point in time as the backup set for the point in time that is prior to the current time; searching the versions of the data objects in the two or more existing backups in the backup storage system to identify data objects having a backup timestamp that is older than a retention time parameter; determining, for each version of a data object, in the versions of the data objects in the two or more existing backups, having a backup timestamp older than the retention time parameter, whether the version is an active version of the data object; and deleting the version of the data object from the two or more existing backups in response to the version of the data object is not an active version of the data object. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product comprising a non-transitory computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
receive a request for generation of a backup set from two or more existing backups, wherein the request includes a point in time for which the backup set is to be generated, wherein the point in time is a point in time prior to a current time, and wherein using the point in time that is prior to the current time causes the backup set that is to be generated from the two or more existing backups to be a collection of data objects from the two or more existing backups that would have been the most recently backed-up version of the data processing system'"'"'s data objects at the point in time that is prior to the current time as though the backup set had been created at the point in time that is prior to the current time; identify versions of the data objects in the two or more existing backups in a backup storage system that would have been a most recently backed-up version of the data objects at the point in time that is prior to the current time as though the backup set had been created at that point in time that is prior to the current time, wherein the computer readable program causes the computing device to identify the versions of the data objects in the two or more existing backups in the backup storage system by; identifying a first candidate set of the versions of the data objects in the two or more existing backups in the backup storage system that have backup timestamps equal to or prior to the point in time, wherein identifying the first candidate set of versions of the data objects, further comprises; retrieving backup catalog information for the versions of the data objects from the two or more existing backups; and searching entries in the backup catalog information to identify entries having a backup timestamp that is equal to or prior to the point in time; and identifying a second candidate set of the versions of the data objects in the first candidate set that have deactivation times on or before the point in time, wherein identifying the second candidate set of versions of the data objects in the first candidate set, further comprises; searching the entries in the first candidate set of the backup catalog information to identify entries having a deactivation time that is on or before the point in time; store the versions of data objects that are identified in the first candidate set as active and are not identified in the second candidate set as being deactivated on or before the point in time as the backup set for the point in time that is prior to the current time; searching the versions of the data objects in the two or more existing backups in the backup storage system to identify data objects having a backup timestamp that is older than a retention time parameter; determining, for each version of a data object, in the versions of the data objects in the two or more existing backups, having a backup timestamp older than the retention time parameter, whether the version is an active version of the data object; and deleting the version of the data object from the two or more existing backups in response to the version of the data object is not an active version of the data object. - View Dependent Claims (8)
-
-
9. A system, comprising:
-
a processor; a memory coupled to the processor; and a storage device coupled to the processor, wherein the memory stores instructions which, when executed by the processor, cause the processor to; receive a request for generation of a backup set from two or more existing backups, wherein the request includes a point in time for which the backup set is to be generated, wherein the point in time is a point in time prior to a current time, and wherein using the point in time that is prior to the current time causes the backup set that is to be generated from the two or more existing backups to be a collection of data objects from the two or more existing backups that would have been the most recently backed-up version of the data processing system'"'"'s data objects at the point in time that is prior to the current time as though the backup set had been created at the point in time that is prior to the current time; identify versions of the data objects in the two or more existing backups in a backup storage system that would have been a most recently backed-up version of the data objects at the point in time that is prior to the current time as though the backup set had been created at that point in time that is prior to the current time, wherein the instructions cause the processor to identify the versions of the data objects in the two or more existing backups in the backup storage system comprises; identifying a first candidate set of the versions of the data objects in the two or more existing backups in the backup storage system that have backup timestamps equal to or prior to the point in time, wherein identifying the first candidate set of versions of the data objects, further comprises; retrieving backup catalog information for the versions of the data objects from the two or more existing backups; and searching entries in the backup catalog information to identify entries having a backup timestamp that is equal to or prior to the point in time; and identifying a second candidate set of the versions of the data objects in the first candidate set that have deactivation times on or before the point in time, wherein identifying the second candidate set of versions of the data objects in the first candidate set, further comprises; searching the entries in the first candidate set of the backup catalog information to identify entries having a deactivation time that is on or before the point in time; store the versions of data objects that are identified in the first candidate set as active and are not identified in the second candidate set as being deactivated on or before the point in time as the backup set for the point in time that is prior to the current time; searching the versions of the data objects in the two or more existing backups in the backup storage system to identify data objects having a backup timestamp that is older than a retention time parameter; determining, for each version of a data object, in the versions of the data objects in the two or more existing backups, having a backup timestamp older than the retention time parameter, whether the version is an active version of the data object; and deleting the version of the data object from the two or more existing backups in response to the version of the data object is not an active version of the data object. - View Dependent Claims (10)
-
Specification