Systems and methods for creating copies of data, such as archive copies
First Claim
Patent Images
1. A method for rebuilding at least a portion of a signature database that reflects contents of an archive copy of a data set, comprising:
- receiving a substantially unique identifier for data objects within the data set;
storing the substantially unique identifiers in a signature database,wherein a substantially unique identifier for a data object reflects contents of the data object;
storing the data set as an archive copy having one or more data chunks,wherein each chunk is stored with header information that includes at least one substantially unique identifier;
receiving an indication that the signature database is unrecoverable or unavailable;
determining at least one substantially unique identifier within the header information; and
using the determined at least one substantially unique identifier from the header information in order to rebuild at least part of the signature database.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method of creating archive copies of data sets is described. In some examples, the system creates an archive copy from an original data set. In some examples, the system creates an archive copy when creating a recovery copy for a data set. In some examples, the system creates a copy without redundant data, and then encrypts the data set.
249 Citations
14 Claims
-
1. A method for rebuilding at least a portion of a signature database that reflects contents of an archive copy of a data set, comprising:
-
receiving a substantially unique identifier for data objects within the data set; storing the substantially unique identifiers in a signature database, wherein a substantially unique identifier for a data object reflects contents of the data object; storing the data set as an archive copy having one or more data chunks, wherein each chunk is stored with header information that includes at least one substantially unique identifier; receiving an indication that the signature database is unrecoverable or unavailable; determining at least one substantially unique identifier within the header information; and using the determined at least one substantially unique identifier from the header information in order to rebuild at least part of the signature database. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of rebuilding a deduplication index that reflects contents of an archive of data objects, the method comprising:
-
storing in a data file a copy of the data objects and hash values generated from the data objects, wherein a header region of the file stores the hash values, and wherein the data file is stored on sequential media; updating an entry in a deduplication index to reflect identification of the data objects, wherein the entry is updated using the hash values; upon receiving an indication that the deduplication index is unavailable or unrecoverable, accessing the hash value from the header region of the data file stored on the sequential media; and using the accessed hash value to rebuild a portion of a new, rebuilt version of the deduplication index. - View Dependent Claims (7, 8, 9)
-
-
10. At least one tangible, computer-readable medium, which when executed by at least one data processing device, rebuilds at least a portion of a single instancing index containing hash values that represent contents of a single instanced data set, comprising:
-
obtaining substantially unique hash values that represent the data set; storing at least some of the obtained hash values that represent the data set in a single instancing index, wherein storing the obtained hash values includes storing the obtained hash values within headers of one or more data files, and wherein the one or more data files form part of an archive file; receiving an indication that at least part of the single instancing index storing hash values that represent the data set is unrecoverable or unavailable; extracting stored hash value information from a header of at least one data file that forms part of the archive file; and
,adding the extracted hash value information to a new, rebuilt version of the single instancing index. - View Dependent Claims (11, 12, 13, 14)
-
Specification