Efficient deduplication database validation
First Claim
1. A system configured to validate a database including deduplication information, the system comprising:
- one or more computing devices each having one or more hardware processors and configured to;
determine that a current version of a first database is invalid, the first database including information about a set of deduplication data blocks, wherein the deduplication data blocks are used to create deduplicated secondary copies on one or more secondary storage devices;
obtain a previous version of the first database;
retrieve a first data fingerprint from a second database, the second database associated with a storage management computer, the first data fingerprint corresponding to a secondary copy job which resulted in the generation of a secondary copy of a file, the first data fingerprint comprising (i) an indication of a first count of unique data blocks stored in the secondary copy of the file and (ii) an indication of a first count of references stored in the secondary copy of the file that reference data blocks stored in other secondary copies;
generate a second data fingerprint using the obtained previous version of the first database;
compare the first data fingerprint with the second data fingerprint; and
in response to a determination that the first data fingerprint and the second data fingerprint do not match, rebuild the first database using the secondary copy of the file.
2 Assignments
0 Petitions
Accused Products
Abstract
According to certain aspects, a method can include receiving an indication that a restoration of a deduplication database using a secondary copy of a file associated with a secondary copy job is complete; retrieving a first data fingerprint from a data storage database, wherein the first data fingerprint is associated with the secondary copy job used to restore the deduplication database; retrieving a second data fingerprint from a deduplication database media agent, wherein the second data fingerprint is associated with the secondary copy job used to restore the deduplication database; comparing the first data fingerprint with the second data fingerprint to determine whether the first data fingerprint and the second data fingerprint match; and transmitting an instruction to the deduplication database media agent to rebuild the restored deduplication database in response to a determination that the first data fingerprint and the second data fingerprint do not match.
-
Citations
20 Claims
-
1. A system configured to validate a database including deduplication information, the system comprising:
one or more computing devices each having one or more hardware processors and configured to; determine that a current version of a first database is invalid, the first database including information about a set of deduplication data blocks, wherein the deduplication data blocks are used to create deduplicated secondary copies on one or more secondary storage devices; obtain a previous version of the first database; retrieve a first data fingerprint from a second database, the second database associated with a storage management computer, the first data fingerprint corresponding to a secondary copy job which resulted in the generation of a secondary copy of a file, the first data fingerprint comprising (i) an indication of a first count of unique data blocks stored in the secondary copy of the file and (ii) an indication of a first count of references stored in the secondary copy of the file that reference data blocks stored in other secondary copies; generate a second data fingerprint using the obtained previous version of the first database; compare the first data fingerprint with the second data fingerprint; and in response to a determination that the first data fingerprint and the second data fingerprint do not match, rebuild the first database using the secondary copy of the file. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A computer-implemented method for validating a database including deduplication information, the computer-implemented method comprising:
-
determining that a current version of a first database is invalid, the first database including information about a set of deduplication data blocks, wherein the deduplication data blocks are used to create deduplicated secondary copies on one or more secondary storage devices; obtaining a previous version of the first database; retrieving a first data fingerprint from a second database, the second database associated with a storage management computer, the first data fingerprint corresponding to a secondary copy job which resulted in the generation of a secondary copy of a file, the first data fingerprint comprising (i) an indication of a first count of unique data blocks stored in the secondary copy of the file and (ii) an indication of a first count of references stored in the secondary copy of the file that reference data blocks stored in other secondary copies; generating a second data fingerprint using the obtained previous version of the first database; comparing the first data fingerprint with the second data fingerprint; and in response to a determination that the first data fingerprint and the second data fingerprint do not match, rebuilding the first database using the secondary copy of the file. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A system configured to validate a database including deduplication information, the system comprising:
one or more computing devices each having one or more hardware processors and configured to; receive an indication that a restoration of a previous version of a first database is complete, the first database including deduplication information; obtain a first data fingerprint of a plurality of data fingerprints, wherein the first data fingerprint is associated with a secondary copy job which resulted in generation of a first secondary copy of a file, wherein the first data fingerprint comprises an indication of a first count of unique data blocks stored in the first secondary copy and an indication of a first count of references stored in the first secondary copy that reference data blocks stored in other secondary copies; obtain a second data fingerprint, wherein the second data fingerprint is derived from the restored previous version of the first database; compare the first data fingerprint with the second data fingerprint to determine whether the first data fingerprint and the second data fingerprint match; and in response to a determination that the first data fingerprint and the second data fingerprint do not match, transmit an instruction to rebuild the first database using the secondary copy of the file. - View Dependent Claims (16, 17, 18, 19, 20)
Specification