FAST DEDUPLICATION DATA VERIFICATION
First Claim
1. A networked information management system configured to verify deduplication information, the networked information management system comprising:
- a storage manager comprising computer hardware configured to;
retrieve, from an electronically stored deduplication database, a deduplication chunk table, wherein the deduplication chunk table identifies a first data chunk;
retrieve, from a secondary storage subsystem, a first single instance file (SFile) associated with the first data chunk, wherein the first SFile comprises a plurality of SFile containers that each store one or more data blocks;
for each SFile container in the plurality of SFile containers, perform a verification of the one or more data blocks of the respective SFile container; and
for each SFile container in the plurality of SFile containers, store a value representing a result of the verification of the one or more data blocks of the respective SFile container in association with an entry corresponding to the first data chunk in a chunk integrity table,wherein the stored values allow the storage manager to verify an integrity of data stored in the first data chunk.
2 Assignments
0 Petitions
Accused Products
Abstract
An information management system provides a data deduplication system that uses a primary table, a deduplication chunk table, and a chunk integrity table to ensure that a referenced deduplicated data block is only verified once during the data verification of a backup or other replication operation. The data deduplication system may reduce the computational and storage overhead associated with traditional data verification processes. The primary table, the deduplication chunk table, and the chunk integrity table, all of which are stored in a deduplication database, can also ensure synchronization between the deduplication database and secondary storage devices.
-
Citations
20 Claims
-
1. A networked information management system configured to verify deduplication information, the networked information management system comprising:
a storage manager comprising computer hardware configured to; retrieve, from an electronically stored deduplication database, a deduplication chunk table, wherein the deduplication chunk table identifies a first data chunk; retrieve, from a secondary storage subsystem, a first single instance file (SFile) associated with the first data chunk, wherein the first SFile comprises a plurality of SFile containers that each store one or more data blocks; for each SFile container in the plurality of SFile containers, perform a verification of the one or more data blocks of the respective SFile container; and for each SFile container in the plurality of SFile containers, store a value representing a result of the verification of the one or more data blocks of the respective SFile container in association with an entry corresponding to the first data chunk in a chunk integrity table, wherein the stored values allow the storage manager to verify an integrity of data stored in the first data chunk. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. A computer-implemented method for verifying deduplication information, the computer-implemented method comprising:
-
retrieving, from an electronically stored deduplication database, a deduplication chunk table, wherein the deduplication chunk table identifies a first data chunk; retrieving, from a secondary storage subsystem, a first single instance file (SFile) associated with the first data chunk, wherein the first SFile comprises a plurality of SFile containers that each store one or more data blocks; for each SFile container in the plurality of SFile containers, performing a verification of the one or more data blocks of the respective SFile container; and for each SFile container in the plurality of SFile containers, storing a value representing a result of the verification of the one or more data blocks of the respective SFile container in association with an entry corresponding to the first data chunk in a chunk integrity table, wherein the stored values allow a storage manager to verify an integrity of data stored in the first data chunk. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification