BACKUP USING A CLIENT-SIDE SIGNATURE REPOSITORY IN A NETWORKED STORAGE SYSTEM
First Claim
1. A method of generating a backup data set for a client computing device by using a signature repository residing in a primary storage subsystem, the method comprising:
- for each respective client computing device of one or more client computing devices in a primary storage subsystem;
monitoring the storage of a plurality of files formed by data blocks generated by one or more software applications running on the respective client computing device,wherein the plurality of files are stored in a data store associated with the respective client computing device;
maintaining, by a repository agent executing on one or more processors in the primary storage subsystem, a repository indicating at least which data blocks of the monitored files are stored in the primary storage subsystem;
in response to instructions to create a secondary copy in a secondary storage subsystem of at least a subset of the plurality of files stored in a data store associated with a first client computing device of the one or more client computing devices,querying the repository to identify at least a first group of data blocks that form at least a portion of the subset of files and for which matching data blocks are not stored in the secondary storage subsystem;
identifying the location of the first group of data blocks within the primary storage subsystem; and
retrieving the first group of data blocks from one or more of the data stores associated with the one or more client computing devices.
4 Assignments
0 Petitions
Accused Products
Abstract
A storage system according to certain embodiments includes a client-side signature repository that includes information representative of a set of data blocks stored in primary storage. During copy or backup operations, the system can use the client-side signature repository to identify data blocks located in primary storage that are new or that have changed. The system can also use the client-side signature repository to identify multiple locations within primary storage where different instances of the data blocks are located.
53 Citations
20 Claims
-
1. A method of generating a backup data set for a client computing device by using a signature repository residing in a primary storage subsystem, the method comprising:
-
for each respective client computing device of one or more client computing devices in a primary storage subsystem; monitoring the storage of a plurality of files formed by data blocks generated by one or more software applications running on the respective client computing device, wherein the plurality of files are stored in a data store associated with the respective client computing device; maintaining, by a repository agent executing on one or more processors in the primary storage subsystem, a repository indicating at least which data blocks of the monitored files are stored in the primary storage subsystem; in response to instructions to create a secondary copy in a secondary storage subsystem of at least a subset of the plurality of files stored in a data store associated with a first client computing device of the one or more client computing devices, querying the repository to identify at least a first group of data blocks that form at least a portion of the subset of files and for which matching data blocks are not stored in the secondary storage subsystem; identifying the location of the first group of data blocks within the primary storage subsystem; and retrieving the first group of data blocks from one or more of the data stores associated with the one or more client computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A storage system for generating a backup data set for a client computing device using a signature repository, the system comprising:
-
a repository indicating which data blocks are stored in a primary storage subsystem, the primary storage subsystem including one or more client computing devices each having an associated data store, wherein the data blocks are generated by one or more software applications running on the one or more client computing devices, a repository agent executing on one or more processors and configured to; maintain the repository; in response to instructions to create a secondary copy in a secondary storage subsystem of at least a subset of the plurality of files stored in the data store associated with a first client computing device of the one or more client computing devices, query the repository to identify at least a first group of data blocks that form at least a portion of the subset of files and for which matching data blocks are not stored in the secondary storage; and provide retrieval information usable to locate the first group of data blocks within the primary storage subsystem. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computer-readable, non-transitory storage medium having one or more computer-executable modules for generating a backup data set for a client computing device, the one or more computer-executable modules comprising:
a first module in communication with one or more client computing devices and configured to; maintain a repository indicating data blocks that are stored in a primary storage subsystem, the primary storage subsystem comprising one or more client computing devices each having an associated data store, wherein the data blocks are generated by one or more software applications running on the one or more client computing devices; in response to instructions to create a secondary copy in a secondary storage subsystem of at least a subset of the plurality of files stored in the data store associated with a first client computing device of the plurality of client computing devices, query the repository to identify at least a first group of data blocks that form at least a portion of the subset of files and for which matching data blocks are not stored in the secondary storage subsystem; provide retrieval information usable to locate the first group of data blocks within the primary storage subsystem.
-
20. The storage medium of claim 25, wherein the first module is further configured to generate a copy operation index that identifies one or more data blocks that form the subset of files including the retrieved first group of data blocks.
Specification