DECENTRALIZING A HADOOP CLUSTER
First Claim
1. A distributed file system comprising:
- data nodes that each include one or more data blocks;
a director unit including master nodes configured in a peer-to-peer distributed architecture and operably coupled to the data nodes, each of the master nodes configured to;
receive a task related to managing data with respect to the distributed file system;
conduct a master node load balancing operation among at least some of the master nodes;
based on the load balancing operation, manage a distribution of the task among one or more of the master nodes and one or more of the data nodes; and
communicate a task status and a status of at least a part of the distributed file system to each of the master nodes not distributed the task.
5 Assignments
0 Petitions
Accused Products
Abstract
In some examples, a distributed file system is described. The distributed file system may include multiple data nodes and a director unit. The multiple data nodes may each include one or more data blocks. The director unit may include multiple master nodes configured in a peer-to-peer distributed architecture and operably coupled to the multiple data nodes. Each of the master nodes may be configured to receive a task related to managing data with respect to the distributed file system; to manage a distribution of the task among one or more of the plurality of master nodes; and to communicate a task status and a status of at least a part of the distributed file system to each of the other master nodes.
-
Citations
40 Claims
-
1. A distributed file system comprising:
-
data nodes that each include one or more data blocks; a director unit including master nodes configured in a peer-to-peer distributed architecture and operably coupled to the data nodes, each of the master nodes configured to; receive a task related to managing data with respect to the distributed file system; conduct a master node load balancing operation among at least some of the master nodes; based on the load balancing operation, manage a distribution of the task among one or more of the master nodes and one or more of the data nodes; and communicate a task status and a status of at least a part of the distributed file system to each of the master nodes not distributed the task. - View Dependent Claims (2, 5, 6, 8, 9, 10, 11, 13, 15, 16, 19)
-
-
3-4. -4. (canceled)
-
7. (canceled)
-
12. (canceled)
-
14. (canceled)
-
17-18. -18. (canceled)
-
20. (canceled)
-
21. A method of managing a distributed file system comprising:
-
receiving a task related to managing data with respect to the distributed file system; selecting one or more master nodes to manage the task, the master nodes configured in a peer-to-peer distributed architecture and operably coupled to data nodes; conducting a master node load balancing operation among at least some of master nodes; based on the load balancing operation, managing the distribution of the task among the data nodes and master nodes, the managing performed by the selected master node; and communicating among at least some of the master nodes a task status and a status of at least a part of the distributed file system. - View Dependent Claims (22, 23, 25, 26, 27, 28, 30, 31, 32, 33, 35)
-
-
24. (canceled)
-
29. (canceled)
-
34. (canceled)
-
36. (canceled)
-
37. A method of managing a distributed file system comprising:
-
receiving a task related to managing data with respect to the distributed file system; selecting one or more master nodes to manage the task, the master nodes configured in a peer-to-peer distributed architecture and operably coupled to data nodes; conducting a master node load balancing operation among at least some of the master nodes; based on the load balancing operation, managing the distribution of the task among one the data nodes and the master nodes, when the task includes accessing a file from the distributed file system, the managing including; receiving the task including file metadata that identifies the file;
attempting to find the file based on the file metadata;distributing the task to another of the master nodes when the file is not found; and transmitting system metadata indicating a location of the file and transmitting a success message when the file is found, and when the task includes a computing task, the managing includes; receiving the computing task from the selected master node; obtaining master node-specific results by performing the computing task locally; determining whether a portion of the master nodes has received the computing task; transmitting the master node-specific results and the computing task to a second master node of the master nodes when the portion of the master nodes has not received the computing task; and transmitting the master node-specific results to the first master node when the portion of the master nodes has received the computing task. - View Dependent Claims (38, 39, 40)
-
Specification