SYSTEMS AND METHODS FOR CREATING COPIES OF DATA, SUCH AS ARCHIVE COPIES
First Claim
Patent Images
1. At least one data storage system that creates an archive copy of data originating from a file system, comprising:
- a hierarchical storage system havinga storage manager server computer, andat least one data store subsystem,wherein the storage manager server computer includes at least one storage policy for directing storage operations,wherein one or more client computers are coupled among the storage manager server computer and the data store subsystem, andwherein the hierarchical storage system comprises;
an archive copy creation subsystem, wherein the archive copy creation subsystem creates a copy of at least a subset of data from the file system based on the storage policy of the storage manager server computer,wherein the archive copy creation subsystem includes;
a data selection component, wherein the data selection component selects the subset of data to be copied based at least in part on the storage policy, wherein the selected subset of data is to be stored for a longer period of time than other data;
an indexing component, wherein the indexing component creates an index of content within the selected subset of data; and
a data adjustment component, communicatively coupled to the data selection component, wherein the data adjustment component
performs data compression of the selected subset of data;
performs data deduplication for the selected subset of data, or
performs both data compression and data deduplication for the selected subset of data.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method of creating archive copies of data sets is described. In some examples, the system creates an archive copy from an original data set. In some examples, the system creates an archive copy when creating a recovery copy for a data set. In some examples, the system creates a copy without redundant data, and then encrypts the data set.
82 Citations
17 Claims
-
1. At least one data storage system that creates an archive copy of data originating from a file system, comprising:
a hierarchical storage system having a storage manager server computer, and at least one data store subsystem, wherein the storage manager server computer includes at least one storage policy for directing storage operations, wherein one or more client computers are coupled among the storage manager server computer and the data store subsystem, and wherein the hierarchical storage system comprises; an archive copy creation subsystem, wherein the archive copy creation subsystem creates a copy of at least a subset of data from the file system based on the storage policy of the storage manager server computer, wherein the archive copy creation subsystem includes; a data selection component, wherein the data selection component selects the subset of data to be copied based at least in part on the storage policy, wherein the selected subset of data is to be stored for a longer period of time than other data; an indexing component, wherein the indexing component creates an index of content within the selected subset of data; and a data adjustment component, communicatively coupled to the data selection component, wherein the data adjustment component
performs data compression of the selected subset of data;
performs data deduplication for the selected subset of data, or
performs both data compression and data deduplication for the selected subset of data.- View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method of archiving an original set of data created by a file system, the method comprising:
-
at a first time, identifying one or more redundant data objects within the original set of data and creating a copy of the original set of data that does not include the identified one or more redundant data objects, wherein the identifying is based on substantially unique identifiers created from data objects in the original set of data, wherein the substantially unique identifiers are based on the content of the data objects in the original set, and, wherein identifying includes searching an index of the substantially unique identifiers; at a second time after the first time, encrypting or compressing the copy of the original set of data; and at a third time after the first time and the second time, storing the copy of the original set of data as an archive copy, wherein the archive copy is stored in a data storage medium separate from the file system, wherein the archive copy includes at least one header file having at least some of the substantially unique identifiers, wherein the archive copy is to be stored for a longer period of time than the production copy of the set of data, wherein the archive copy is in a format that differs from a format of the production copy of the set of data, and wherein data within the archive copy can not be used by applications that created the set of data without first decrypting, decompressing or converting the data within the archive copy. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. At least one computer-readable medium storing instructions that when executed by at least one data processing device causes the creation of a copy of a production volume of data for archiving the production volume, comprising:
-
receiving or accessing two or more secondary copies of the production volume of the data, wherein the two or more secondary copies include multiple instances of one or more data objects within the volume of data; creating or accessing an index of substantially unique identifiers for each data object within the two or more secondary copies of the production volume; identifying multiple instances of the one or more data objects using the index of the substantially unique identifiers; storing the data from the two or more secondary copies into an archive copy, wherein the stored data includes only one instance for at least one data object having multiple instances, wherein the stored data includes at least part of the index of substantially unique identifiers in a header file of the archive copy; and encrypting or compressing the data stored within the archive copy, wherein the archive copy is for storing data for a longer period of time than the two or more secondary copies. - View Dependent Claims (15, 16, 17)
-
Specification