Replication using deduplicated secondary copy data
First Claim
1. A system for replicating primary data from a source system to a destination system using secondary copy data in secondary storage, the system comprising:
- a source system comprising;
one or more source system client computing devices comprising hardware and one or more source system primary storage devices that store primary data, the source system primary data stored in a native format;
one or more source system secondary storage controller computers comprising hardware that create deduplicated secondary data with corresponding signature values when the number of writes of source system primary data on the source system primary storage devices exceed a predefined threshold, the deduplicated secondary data stored in one or more source system secondary storage devices in a secondary storage format that is different than the native format of the source system primary data;
at least one source system secondary storage replication agent comprising computer hardware, the source system secondary storage replication agent configured to generate one or more secondary data replication streams comprising replication command messages having timestamps associated therewith, the one or more secondary data replication streams further comprising the deduplicated secondary data, and embedded cone soon-dine denature values, the embedded corresponding signature values stored contiguously with the deduplicated secondary data in the one or more secondary data replication streams; and
a destination system comprising;
one or more destination client computing devices comprising hardware and one or more destination primary storage devices that store replicated destination system primary data in a native format;
at least one destination system secondary storage replication agent comprising computer hardware, the destination system secondary storage replication agent receives the one or more secondary data replication streams, the destination system secondary storage replication agent configured to create a replicated deduplicated secondary copy based on the deduplicated secondary data and the embedded corresponding signature values stored contiguously with the deduplicated secondary data, the replicated deduplicated secondary copy is stored in one or more destination system secondary storage devices m the secondary storage format; and
at least one destination data agent configured to apply the replication command messages received in the one or more secondary data replication streams to a restored version, of the replicated deduplicated secondary copy based on the timestamps to create the replicated destination system primary data consistent with the source system primary data based on the timestamps, the replicated destination system primary data stored in the one or more destination primary storage devices in the native format of the primary data.
2 Assignments
0 Petitions
Accused Products
Abstract
An information management system according to certain aspects uses backup copies or other secondary copies of production data for the purposes of replicating production data to another client. The secondary copies can be deduplicated copies. By utilizing available secondary copies of the data for replication, the system can reduce the impact on the production machines associated with replication. Utilizing deduplicated copies not only reduces the amount of stored data, but also reduces the amount of data that is communicated between the source and the destination, increasing the speed of the replication process.
-
Citations
20 Claims
-
1. A system for replicating primary data from a source system to a destination system using secondary copy data in secondary storage, the system comprising:
-
a source system comprising; one or more source system client computing devices comprising hardware and one or more source system primary storage devices that store primary data, the source system primary data stored in a native format; one or more source system secondary storage controller computers comprising hardware that create deduplicated secondary data with corresponding signature values when the number of writes of source system primary data on the source system primary storage devices exceed a predefined threshold, the deduplicated secondary data stored in one or more source system secondary storage devices in a secondary storage format that is different than the native format of the source system primary data; at least one source system secondary storage replication agent comprising computer hardware, the source system secondary storage replication agent configured to generate one or more secondary data replication streams comprising replication command messages having timestamps associated therewith, the one or more secondary data replication streams further comprising the deduplicated secondary data, and embedded cone soon-dine denature values, the embedded corresponding signature values stored contiguously with the deduplicated secondary data in the one or more secondary data replication streams; and a destination system comprising; one or more destination client computing devices comprising hardware and one or more destination primary storage devices that store replicated destination system primary data in a native format; at least one destination system secondary storage replication agent comprising computer hardware, the destination system secondary storage replication agent receives the one or more secondary data replication streams, the destination system secondary storage replication agent configured to create a replicated deduplicated secondary copy based on the deduplicated secondary data and the embedded corresponding signature values stored contiguously with the deduplicated secondary data, the replicated deduplicated secondary copy is stored in one or more destination system secondary storage devices m the secondary storage format; and at least one destination data agent configured to apply the replication command messages received in the one or more secondary data replication streams to a restored version, of the replicated deduplicated secondary copy based on the timestamps to create the replicated destination system primary data consistent with the source system primary data based on the timestamps, the replicated destination system primary data stored in the one or more destination primary storage devices in the native format of the primary data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of replicating primary data from a source system to a destination system using secondary copy data in secondary storage, the method comprising:
-
creating with one or more source secondary storage controllers deduplicated secondary data that is a copy of source system primary data on one or more source system primary storage devices when the number of writes of the source system primary data on the source system primary storage devices exceed a predefined threshold, the source system primary data stored in a native format and the deduplicated secondary data stored in a backup format that is different than the native format of the source system primary data; sending with a source system secondary storage replication agent comprising computer hardware, one or more secondary data replication streams comprising replication command messages having timestamps associated therewith, the one or more secondary data replication streams further comprising the deduplicated secondary data and embedded corresponding signature values, the embedded corresponding signature voices stored contiguously with the deduplicated secondary data in the one or more secondary data replication streams receiving the one or more secondary data replication streams with at least one destination system secondary storage replication agent comprising computer hardware, creating a replicated deduplicated secondary copy based on the deduplicated secondary data and the embedded corresponding signature values stored contiguously with the deduplicated secondary data, and storing the replicated deduplicated secondary COPY In one or more destination system secondary storage devices in the secondary storage format; and creating a restored version of the replicated deduplicated secondary copy and applying, with a destination data agent, the replication command messages received m the one or more secondary data replication streams to the restored version of the replicated deduplicated secondary copy based on the timestamps to create replicated destination system primary data consistent with the source system primary data based on the timestamps, the replicated destination system primary data stored in the one or more destination primary storage devices in the native formal of the source system primary data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification