BACKUP AND RESTORATION FOR A DEDUPLICATED FILE SYSTEM
First Claim
1. A method of backing up data from a source file system of a computer device to a backup data storage system, the method comprising:
- checking whether a source file is stored on the source file system in a deduplicated format;
when the checking indicates that the source file is stored in a deduplicated format, then—
determining a block size value used to store the source file in the deduplicated format;
accessing, without a file read/write assistance from an operating system running on the computer device, a local deduplication database to determine a location of a first data chunk of the source file stored in the deduplicated format on a local storage device; and
,backing up the source file by accessing and selectively transferring the first data chunk and successive data chunks of the source file by;
transferring a given data chunk, if the local deduplication database indicates that the given data chunk was not deduplicated; and
transferring a deduplication record, without transferring the given data chink, if the local deduplication database indicates that the given data chunk was deduplicated; and
when the check indicates that the source file is stored without deduplication on the computer device, then backing up the source file by transferring data chunks of the source file to the backup data storage system and performing deduplication on the data chunks of the source file.
4 Assignments
0 Petitions
Accused Products
Abstract
The disclosed techniques that can use deduplication information on a source computer platform to improve the process of performing data backups or restoration from/to the computer platform. In one example aspect, a data backup operation can re-use some of the work already done by a source computer'"'"'s deduplication system. For example, a storage operation could read a deduplication database on the source computer platform to determine the duplicativeness of a given data chunk being transferred to a backup storage system, without having to perform computations such as data chunk hashing and comparison with previously generated hashes. The technique may additionally or alternatively reuse hashes generated by the source computer during deduplication of the data file on the source computer'"'"'s file system during deduplication at the external backup storage system.
-
Citations
18 Claims
-
1. A method of backing up data from a source file system of a computer device to a backup data storage system, the method comprising:
-
checking whether a source file is stored on the source file system in a deduplicated format; when the checking indicates that the source file is stored in a deduplicated format, then— determining a block size value used to store the source file in the deduplicated format; accessing, without a file read/write assistance from an operating system running on the computer device, a local deduplication database to determine a location of a first data chunk of the source file stored in the deduplicated format on a local storage device; and
,backing up the source file by accessing and selectively transferring the first data chunk and successive data chunks of the source file by; transferring a given data chunk, if the local deduplication database indicates that the given data chunk was not deduplicated; and transferring a deduplication record, without transferring the given data chink, if the local deduplication database indicates that the given data chunk was deduplicated; and when the check indicates that the source file is stored without deduplication on the computer device, then backing up the source file by transferring data chunks of the source file to the backup data storage system and performing deduplication on the data chunks of the source file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for backing up data from a source file system of a computer device to a backup data storage system, the apparatus comprising:
-
means for checking whether a source file is stored on the source file system in a deduplicated format; means for, when the checking indicates that the source file is stored in a deduplicated format, then— determining a block size value used to store the source file in the deduplicated format; accessing, without a file read/write assistance from an operating system running on the computer device, a local deduplication database to determine a location of a first data chunk of the source file stored in the deduplicated format on a local storage device; and
,backing up the source file by accessing and selectively transferring the first data chunk and successive data chunks of the source file by; transferring a given data chunk, if the local deduplication database indicates that the given data chunk was not deduplicated; and transferring a deduplication record, without transferring the given data chink, if the local deduplication database indicates that the given data chunk was deduplicated; and means for, when the check indicates that the source file is stored without deduplication on the computer device, then backing up the source file by transferring data chunks of the source file to the backup data storage system and performing deduplication on the data chunks of the source file. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A method of restoring data from a data storage system to a source file system that can store files in a deduplicated format, the method comprising:
-
determining a block size value used to restore a copied file in the deduplicated format; for each data chunk of the copied file; if the data chunk was not previously transmitted during the restoration, then; transferring the data chunk to the source file system, and updating a deduplication database at the source file system with a pointer a location where the data chunk is transferred to; and if the data chunk was previously transmitted during the restoration, then; updating the deduplication database at the source file system without transferring the data chunk with a deduplication entry. - View Dependent Claims (14, 15)
-
-
16. A computer program product comprising a computer-readable medium having instructions stored thereon, the instructions, when executed by a processor, causing the processor to implement method of restoring data from a data storage system to a source file system that can store files in a deduplicated format, the method comprising:
-
determining a block size value used to restore a copied file in the deduplicated format; for each data chunk of the copied file; if the data chunk was not previously transmitted during the restoration, then; transferring the data chunk to the source file system, and updating a deduplication database at the source file system with a pointer a location where the data chunk is transferred to; and if the data chunk was previously transmitted during the restoration, then; updating the deduplication database at the source file system without transferring the data chunk with a deduplication entry. - View Dependent Claims (17, 18)
-
Specification