Load balancing across multiple data paths
First Claim
Patent Images
1. An apparatus for backing up data from a primary storage to a secondary storage, the apparatus comprising:
- at least one processor;
a module identifies data to be backed up from primary storage, including;
a first subset of the data to be backed up, which is tagged as an application-level characteristic of user data, anda second subset of the data to be backed up, which is tagged as an application-level characteristics of index data or log data;
a module allocates a job for transferring the data from the primary storage to the secondary storage,wherein the secondary storage comprises multiple storage locations;
a module divides the job into multiple data streams,wherein each data stream of the multiple data streams has a corresponding data path for carrying data, and is associated with the first subset of the data or the second subset of the data based on the identified application-level characteristics tagged to the data, and wherein each data path is associated with a corresponding and different media agent and a corresponding and different destination storage location from the multiple storage locations of the secondary storage; and
a module stores, for each unit of data transferred from the primary storage to the secondary storage, information identifying a primary storage location for the unit of data and a destination storage location at which the unit of data is backed up; and
a module transfers the data to be backed up from the primary storage to the secondary storage,wherein the first subset of the data is transferred via first data streams of the multiple data streams that are associated with the first subset of the data based on the identified application-level characteristics of the data, andwherein the second subset of the data is transferred via second, different, data streams of the multiple data streams that are associated with the second subset of the data based on the identified application-level characteristics of the data.
4 Assignments
0 Petitions
Accused Products
Abstract
Multiple data paths may be available to a data management system for transferring data between a primary storage device and a secondary storage device. The data management system may be able to gain operational advantages by performing load balancing across the multiple data paths. The system may use application layer characteristics of the data for transferring from a primary storage to a backup storage during data backup operation, and correspondingly from a secondary or backup storage system to a primary storage system during restoration.
357 Citations
20 Claims
-
1. An apparatus for backing up data from a primary storage to a secondary storage, the apparatus comprising:
-
at least one processor; a module identifies data to be backed up from primary storage, including; a first subset of the data to be backed up, which is tagged as an application-level characteristic of user data, and a second subset of the data to be backed up, which is tagged as an application-level characteristics of index data or log data; a module allocates a job for transferring the data from the primary storage to the secondary storage, wherein the secondary storage comprises multiple storage locations; a module divides the job into multiple data streams, wherein each data stream of the multiple data streams has a corresponding data path for carrying data, and is associated with the first subset of the data or the second subset of the data based on the identified application-level characteristics tagged to the data, and wherein each data path is associated with a corresponding and different media agent and a corresponding and different destination storage location from the multiple storage locations of the secondary storage; and a module stores, for each unit of data transferred from the primary storage to the secondary storage, information identifying a primary storage location for the unit of data and a destination storage location at which the unit of data is backed up; and a module transfers the data to be backed up from the primary storage to the secondary storage, wherein the first subset of the data is transferred via first data streams of the multiple data streams that are associated with the first subset of the data based on the identified application-level characteristics of the data, and wherein the second subset of the data is transferred via second, different, data streams of the multiple data streams that are associated with the second subset of the data based on the identified application-level characteristics of the data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of backing up data from a primary storage to a secondary storage, the method comprising:
-
identifying a first subset of the data to be backed up and a second subset of the data to be backed up from the primary storage based on a value of an application level characteristic that is different for the first subset of the data and the second subset of the data, wherein the first subset of the data includes data tagged as user data, and wherein the second subset of the data includes data tagged as control data, index data or log data; allocating a job for transferring the data from the primary storage to the secondary storage, wherein the secondary storage comprises a plurality of storage locations; dividing the job into multiple streams, each stream having a corresponding data path for carriage of data, wherein each data path is associated with a corresponding media agent and a corresponding destination storage location from the plurality of storage locations of the secondary storage; maintaining, for each unit of data transferred from the primary storage to the secondary storage, information identifying a primary storage location for the unit of data and a destination storage location at which the unit of data is backed up; and transferring the data to be backed up using one or more streams for the first subset of data and one or more different streams for the second subset of data. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A method of restoring data from a secondary storage to a primary storage, the data having been copied from the primary storage to the secondary storage in multiple subsets based on application-level characteristics tagged to data of each subset, the method comprising:
-
allocating a job for transferring the data from the secondary storage to the primary storage, wherein the secondary storage comprises a plurality of storage locations, and wherein each storage location stores at least one of the multiple subsets of the data; determining, for each unit of data to be transferred from the secondary storage to the primary storage, information identifying a primary storage location for the unit of data to which the unit of data is to be restored; dividing the job into multiple streams based on the application-level characteristics tagged to the data of each subset, wherein the multiple streams include at least one stream associated with data tagged with an application-level characteristic of user data and at least one stream associated with data tagged with an application-level characteristic of control data, index data, or log data, wherein each stream has a corresponding data path for carrying data, and wherein each data path is associated with a corresponding and different media agent and a corresponding and different storage location from the plurality of storage locations of the secondary storage; assigning the data path for restoring one of the subset of the data stored at the corresponding storage location; and restoring the multiple subsets of data, wherein at least two subsets of data are restored using two different data paths. - View Dependent Claims (17, 18, 19, 20)
-
Specification