Method and system of producing a full backup image using an incremental backup method
First Claim
1. A method for performing backups of a data set, wherein the data set comprises a plurality of data items, the method comprising:
- initiating a partial backup of the data set, wherein said partial backup includes less than all of the data set;
wherein as part of said partial backup, the method comprises;
conveying a first subset of the plurality of data items to a deduplication engine, wherein the first subset comprises data items that are new or have changed since a previous backup of the data set;
generating an extents catalog for the partial backup which fully describes all of the data set;
retrieving references associated with a second subset of the plurality of data items from the extents catalog, wherein the second subset comprises data items that are unchanged since a previous backup of the data set; and
conveying references associated with the second subset of data items to the deduplication engine, wherein said references comprise identifications of one or more previous images corresponding to previous full or partial backups of the data set;
at the deduplication engine;
generating fingerprints for the first subset of the plurality of data items;
creating a reference map and storing the fingerprints in said reference map; and
receiving references associated with the second subset of data items and storing said references in the reference map.
7 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and medium for performing incremental backups of a data set within a deduplication storage system. An incremental backup of a data set is initiated, and the changed data items of the data set are deduplicated. An extents catalog is created to fully describe the data set. References associated with the unchanged data items of the data set are retrieved from the extents catalog and then sent to a deduplication engine. The deduplication engine uses the references to retrieve fingerprints associated with the unchanged data items from a previous reference map. Then, the deduplication engine creates a new reference map for the incremental backup and stores the newly generated and retrieved fingerprints in the new reference map.
-
Citations
17 Claims
-
1. A method for performing backups of a data set, wherein the data set comprises a plurality of data items, the method comprising:
-
initiating a partial backup of the data set, wherein said partial backup includes less than all of the data set; wherein as part of said partial backup, the method comprises; conveying a first subset of the plurality of data items to a deduplication engine, wherein the first subset comprises data items that are new or have changed since a previous backup of the data set; generating an extents catalog for the partial backup which fully describes all of the data set; retrieving references associated with a second subset of the plurality of data items from the extents catalog, wherein the second subset comprises data items that are unchanged since a previous backup of the data set; and conveying references associated with the second subset of data items to the deduplication engine, wherein said references comprise identifications of one or more previous images corresponding to previous full or partial backups of the data set; at the deduplication engine; generating fingerprints for the first subset of the plurality of data items; creating a reference map and storing the fingerprints in said reference map; and receiving references associated with the second subset of data items and storing said references in the reference map. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer readable storage medium comprising program instructions to perform backups of a data set, wherein the data set comprises a plurality of data items, and wherein when executed the program instructions are operable to:
-
initiate a partial backup of the data set, wherein said partial backup includes less than all of the data set; wherein as part of said partial backup, the program instructions are executable to; convey a first subset of the plurality of data items to a deduplication engine, wherein the first subset comprises data items that are new or have changed since a previous backup of the data set; generate an extents catalog for the partial backup which fully describes all of the data set; retrieve references associated with a second subset of the plurality of data items from the extents catalog, wherein the second subset comprises data items that are unchanged since a previous backup of the data set; and convey references associated with the second subset of data items to the deduplication engine, wherein said references comprise identifications of one or more previous images corresponding to previous full or partial backups of the data set; at the deduplication engine; generate fingerprints for the first subset of the plurality of data items; create a reference map and store the fingerprints in said reference map; and receive references associated with the second subset of data items and storing said references in the reference map. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system for performing backups of a data set, wherein the data set comprises a plurality of data items, the system comprising:
-
a server; and one or more storage devices coupled to the server; wherein the server is configured to; initiate a partial backup of the data set, wherein said partial backup includes less than all of the data set; wherein as part of said partial backup, the system is configured to; convey a first subset of the plurality of data items to a deduplication engine, wherein the first subset comprises data items that are new or have changed since a previous backup of the data set; generate an extents catalog for the partial backup which fully describes all of the data set; retrieve references associated with a second subset of the plurality of data items from the extents catalog, wherein the second subset comprises data items that are unchanged since a previous backup of the data set; and convey references associated with the second subset of data items to the deduplication engine, wherein said references comprise identifications of one or more previous images corresponding to previous full or partial backups of the data set; wherein the server is further configured to; generate fingerprints for the first subset of the plurality of data items; create a reference map and store the fingerprints in said reference map; and receive references associated with the second subset of data items and storing said references in the reference map. - View Dependent Claims (14, 15, 16, 17)
-
Specification