Stubbing systems and methods in a data replication environment
First Claim
1. A method of managing the storage of data in a computer network, the method comprising:
- receiving data from a source system comprising computer hardware at a destination system comprising computer hardware including one or more processors, the data comprising at least one first stub file comprising information indicative of a location of additional data on a storage device;
storing the data at a destination storage device included with the destination system, wherein the destination storage device differs from the storage device;
identifying a first portion of the data for storage at a secondary storage device, wherein the secondary storage device differs from the storage device and the destination storage device;
providing a copy of the first portion of the data for storage in the secondary storage device;
generating a second stub file representative of the first portion of the data;
replacing the first portion of the data at the destination storage device with the second stub file; and
tagging the second stub file with an identifier that distinguishes stub files generated at the destination system from stub files received with the data from the source system, wherein the second stub file comprises information indicative of a location of the copy of the first portion of the data on the secondary storage device, and wherein the at least one first stub file is not tagged with the identifier.
4 Assignments
0 Petitions
Accused Products
Abstract
Stubbing systems and methods are provided for intelligent data management in a replication environment, such as by reducing the space occupied by replication data on a destination system. In certain examples, stub files or like objects replace migrated, de-duplicated or otherwise copied data that has been moved from the destination system to secondary storage. Access is further provided to the replication data in a manner that is transparent to the user and/or without substantially impacting the base replication process. In order to distinguish stub files representing migrated replication data from replicated stub files, priority tags or like identifiers can be used. Thus, when accessing a stub file on the destination system, such as to modify replication data or perform a restore process, the tagged stub files can be used to recall archived data prior to performing the requested operation so that an accurate copy of the source data is generated.
-
Citations
18 Claims
-
1. A method of managing the storage of data in a computer network, the method comprising:
-
receiving data from a source system comprising computer hardware at a destination system comprising computer hardware including one or more processors, the data comprising at least one first stub file comprising information indicative of a location of additional data on a storage device; storing the data at a destination storage device included with the destination system, wherein the destination storage device differs from the storage device; identifying a first portion of the data for storage at a secondary storage device, wherein the secondary storage device differs from the storage device and the destination storage device; providing a copy of the first portion of the data for storage in the secondary storage device; generating a second stub file representative of the first portion of the data; replacing the first portion of the data at the destination storage device with the second stub file; and tagging the second stub file with an identifier that distinguishes stub files generated at the destination system from stub files received with the data from the source system, wherein the second stub file comprises information indicative of a location of the copy of the first portion of the data on the secondary storage device, and wherein the at least one first stub file is not tagged with the identifier. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for managing the storage of data in a computer network, the system comprising:
computer hardware configured to execute a data manager, the data manager configured to; scan data located at a destination storage device to identify a first portion of the data to store at a secondary storage device, the data comprising at least a first stub file indicative of a portion of the data stored at a storage device that differs from the destination storage device and the secondary storage device; provide a copy of the first portion of the data for storage in the secondary storage device; generate a second stub file representative of the first portion of the data; replace the first portion of the data at the destination storage device with the second stub file indicative of a location of the copy of the first portion of the data on the secondary storage device; and tag the second stub file with an identifier that distinguishes stub files generated by the data manager from stub files indicative of portions of the data stored at the storage device, wherein the first stub file is not tagged with the identifier. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
18. Non-transitory computer storage comprising instructions which, when executed, cause the computing system to perform steps comprising:
-
receiving data at a first system comprising computer hardware including one or more processors from a second system comprising computer hardware including one or more processors, the data comprising a first stub file comprising information indicative of a location of additional data on a separate storage device, wherein the separate storage device is not included with the first system; storing the data at a first storage device included with the first system; identifying a first portion of the data for storage at a second storage device, wherein the second storage device differs from the separate storage; providing a copy of the first portion of the data for storage in the second storage device; generating a second stub file representative of the first portion of the data; replacing the first portion of the data at the first storage device with the second stub file; and tagging the second stub file with an identifier that distinguishes stub files generated at the first system from stub files received with the data from the second system, wherein the second stub file comprises information indicative of a location of the copy of the first portion of the data on the second storage device, and wherein the first stub file is not tagged with the identifier.
-
Specification