Data management and backup of distributed storage environment
First Claim
1. A distributed system for monitoring data stored in a distributed computing environment, the distributed system comprising on or more processors:
- a control agent implemented on a control computing system; and
a plurality of data access nodes implemented, respectively, on a plurality of computing systems,wherein the control agent is programmed to;
access configuration information for a distributed storage environment from a configuration node of the distributed storage environment, wherein the distributed storage environment comprises a plurality of storage nodes;
determine a distributed file system type for the distributed storage environment based at least in part on the configuration information;
determine a total number of storage nodes within the distributed storage environment based at least in part on the configuration information;
make a determination on how to assign the plurality of data access nodes to back up the plurality of storage nodes by distributing the plurality of data access nodes among the plurality of storage nodes based at least in part on the distributed file system type and the total number of storage nodes;
assign the plurality of data access nodes to the plurality of storage nodes based on the determination; and
back up the plurality of storage nodes to the plurality of data access nodes.
2 Assignments
0 Petitions
Accused Products
Abstract
In certain systems disclosed herein, a distributed data monitoring and management system is provided that can replicate a distributed storage environment. The distributed data monitoring and management system can intelligently and automatically configure data access nodes to form a structure that matches the distributed storage environment. By matching the structure of the distributed storage environment, the distributed structure of the data may be maintained, enabling the data to be backed up from and/or restored to the distributed storage environment and/or migrated to another distributed storage environment without altering the distribution of the data. Further, embodiments herein enable the transfer of data from a non-distributed environment to a distributed storage environment. Thus, in some cases, an entity can migrate data from a local storage structure to a network-based distributed storage structure.
130 Citations
20 Claims
-
1. A distributed system for monitoring data stored in a distributed computing environment, the distributed system comprising on or more processors:
-
a control agent implemented on a control computing system; and a plurality of data access nodes implemented, respectively, on a plurality of computing systems, wherein the control agent is programmed to; access configuration information for a distributed storage environment from a configuration node of the distributed storage environment, wherein the distributed storage environment comprises a plurality of storage nodes; determine a distributed file system type for the distributed storage environment based at least in part on the configuration information; determine a total number of storage nodes within the distributed storage environment based at least in part on the configuration information; make a determination on how to assign the plurality of data access nodes to back up the plurality of storage nodes by distributing the plurality of data access nodes among the plurality of storage nodes based at least in part on the distributed file system type and the total number of storage nodes; assign the plurality of data access nodes to the plurality of storage nodes based on the determination; and back up the plurality of storage nodes to the plurality of data access nodes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method of monitoring data stored in a distributed computing environment, the computer-implemented method comprising:
-
as implemented by a control agent comprising one or more hardware processors and configured with specific computer-executable instructions, accessing configuration information for a distributed storage environment from a configuration node of the distributed storage environment, wherein the distributed storage environment comprises a plurality of storage nodes; determining a distributed file system type for the distributed storage environment based at least in part on the configuration information; determining a total number of storage nodes within the distributed storage environment based at least in part on the configuration information; determining how to assign a plurality of data access nodes to back up the plurality of storage nodes by distributing the plurality of data access nodes among the plurality of storage nodes based at least in part on the distributed file system type and the total number of storage nodes, wherein the plurality of data access nodes are implemented, respectively, on a plurality of computing systems; assigning the plurality of data access nodes to the plurality of storage nodes based on the determining how to assign the plurality of data access nodes; and backing up the plurality of storage nodes to the plurality of data access nodes. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification