Multi-threaded scanning of distributed file systems
First Claim
1. A computer-implemented method of performing multithreaded scanning of a network storage system, the computer-implemented method comprising:
- as implemented by a data agent executing within a client computing device, wherein the client computing device is configured in an information management system;
determining network characteristics of a network used by the client computing device to communicate with a network storage system that is remote from the information management system and stores data accessible by the client computing device, wherein the data is associated with the data agent,selecting an amount of scanning threads to initiate based at least in part on the network characteristics of the network, wherein the scanning threads are configured to scan a network storage repository of the network storage system to identify files to backup that are stored at the network storage repository,triggering initiation of the amount of the scanning threads at the network storage system, and initiating scanning of the network storage repository using the scanning threads to identify the files to backup;
wherein to select the amount of scanning threads, the data agent balances an amount of time to complete the scanning of the network storage repository using the scanning threads with an amount of network resources to communicate results of the scanning.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments disclosed herein address the need to more efficiently backup a network-based storage environment that may be remote from a primary storage environment. For example, embodiments herein can provide a more efficient backup of a storage managed by a third-party entity. To improve the backup process, embodiments herein may optimize the number of scanning threads that are used to identify files that are to be backed up by, for example, analyzing the characteristics of the network and/or the network storage system to determine a number of scanning threads that will enable faster scanning of the network storage system while at the same time not overburden or be slowed down by a network between the network-based storage environment and the primary storage environment.
290 Citations
19 Claims
-
1. A computer-implemented method of performing multithreaded scanning of a network storage system, the computer-implemented method comprising:
-
as implemented by a data agent executing within a client computing device, wherein the client computing device is configured in an information management system; determining network characteristics of a network used by the client computing device to communicate with a network storage system that is remote from the information management system and stores data accessible by the client computing device, wherein the data is associated with the data agent, selecting an amount of scanning threads to initiate based at least in part on the network characteristics of the network, wherein the scanning threads are configured to scan a network storage repository of the network storage system to identify files to backup that are stored at the network storage repository, triggering initiation of the amount of the scanning threads at the network storage system, and initiating scanning of the network storage repository using the scanning threads to identify the files to backup; wherein to select the amount of scanning threads, the data agent balances an amount of time to complete the scanning of the network storage repository using the scanning threads with an amount of network resources to communicate results of the scanning. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for performing multi-threaded scanning of a network storage system, the system comprising:
-
an information management system implemented in computer hardware, wherein a client computing device in the information management system communicates over a network with a network storage system that comprises data files accessible by the client computing device, and wherein the client computing device executes a data agent that is associated with the data files; wherein the data agent is configured to; determine network characteristics of the network used by the client computing device to communicate with the network storage system; select an amount of scanning threads to initiate based at least in part on the network characteristics of the network, wherein the scanning threads are configured to scan a network storage repository of the network storage system to identify data files to backup that are stored at the network storage repository, trigger initiation of the amount of the scanning threads at the network storage system, and initiate scanning of the network storage repository using the scanning threads to identify the files to backup; and wherein to select the amount of scanning threads, the data agent is configured to balance an amount of time to complete the scanning of the network storage repository using the scanning threads with an amount of network resources to communicate results of the scanning. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification