Systems, methods, and devices for dynamic resource monitoring and allocation in a cluster system
First Claim
Patent Images
1. A hadoop computer cluster comprising:
- one or more processors of a master node, wherein the master node comprises a supervisor controller; and
one or more processors of a plurality of computing system nodes, the one or more processors of the plurality of computing system nodes configured to perform computing processes on received tasks, wherein each computing system node comprises an agent controller,wherein each agent controller is configured to monitor system resources utilization by one or more tasks of the respective computing system node and to transmit information describing the monitored system resources utilization to the supervisor controller,wherein the supervisor controller is configured to assign an additional task to a first computing system node based on determining that the utilization of electronic random access memory of the first computing system node is below a first threshold level, the determination based at least in part on the information describing the monitored system resources utilization, transmitted from the agent controller of the first computing system node,wherein the supervisor controller is further configured to reduce the use of the electronic random access memory of the second computing system node by a task assigned to the second computing system node based on determining that utilization of the electronic random access memory of the second computing system node is above a second threshold level, the determination based at least in part on the information describing the monitored system resources utilization, transmitted from the agent controller of the second computing system node, andwherein the master node and the plurality of computing system nodes include a computer processor and an electronic storage medium.
1 Assignment
0 Petitions
Accused Products
Abstract
In an embodiment, the systems, methods, and devices disclosed herein comprise a computer resource monitoring and allocation system. In an embodiment, the resource monitoring and allocation system can be configured to allocate computer resources that are available on various nodes of a cluster to specific jobs and/or sub-jobs and/or tasks and/or processes.
41 Citations
30 Claims
-
1. A hadoop computer cluster comprising:
-
one or more processors of a master node, wherein the master node comprises a supervisor controller; and one or more processors of a plurality of computing system nodes, the one or more processors of the plurality of computing system nodes configured to perform computing processes on received tasks, wherein each computing system node comprises an agent controller, wherein each agent controller is configured to monitor system resources utilization by one or more tasks of the respective computing system node and to transmit information describing the monitored system resources utilization to the supervisor controller, wherein the supervisor controller is configured to assign an additional task to a first computing system node based on determining that the utilization of electronic random access memory of the first computing system node is below a first threshold level, the determination based at least in part on the information describing the monitored system resources utilization, transmitted from the agent controller of the first computing system node, wherein the supervisor controller is further configured to reduce the use of the electronic random access memory of the second computing system node by a task assigned to the second computing system node based on determining that utilization of the electronic random access memory of the second computing system node is above a second threshold level, the determination based at least in part on the information describing the monitored system resources utilization, transmitted from the agent controller of the second computing system node, and wherein the master node and the plurality of computing system nodes include a computer processor and an electronic storage medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A supervisor controller configured to dynamically manage assignment of tasks in a hadoop computer cluster, the supervisor controller comprising:
-
a management controller interface configured to communicate with a management controller to access data representing an assignment of a plurality of tasks across a plurality of computer system nodes in the computer cluster; an agent controller interface configured to communicate with an agent controller operating on a first computing system node to receive information describing utilization of system resources on the first computing system node, wherein the agent controller interface is further configured to communicate with an agent controller operating on a second computing system node to receive information describing utilization of system resources on the second computing system node; and a system resource allocation engine configured to assign an additional task to the first computing system node based on determining that the utilization of electronic random access memory of the first computing system node is below a first threshold level, the determination based at least in part on the information describing utilization of system resources on the first computing system node, wherein one or more computer processors and one or more electronic storage medium are configured to operate the supervisor controller, and wherein the system resource allocation engine is further configured to reduce the use of electronic random access memory of the second computing system node by a task assigned to the second computing system node based on determining that the utilization of the electronic random access memory of the second computing system node is above a second threshold level, the determination based at least in part on the information describing utilization of system resources on the second computing system node. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A supervisor controller configured to dynamically manage assignment of tasks in a hadoop computer cluster, the supervisor controller comprising:
-
a management controller interface configured to communicate with a management controller to access data representing an assignment of a plurality of tasks across a plurality of computer system nodes in the computer cluster; an agent controller interface configured to communicate with an agent controller operating on a first computing system node to receive information describing utilization of system resources on the first computing system node, wherein the agent controller interface is further configured to communicate with an agent controller operating on a second computing system node to receive information describing utilization of system resources on the second computing system node; and a system resource allocation engine configured to assign an additional task to the first computing system node based on determining that the utilization of a CPU of the first computing system node is below a first threshold level, the determination based at least in part on the information describing utilization of system resources on the first computing system node, wherein one or more computer processors and one or more electronic storage medium are configured to operate the supervisor controller, and wherein the system resource allocation engine is further configured to reduce the use of a CPU of the second computing system node by a task assigned to the second computing system node based on determining that the utilization of the CPU of the second computing system node is above a second threshold level, the determination based at least in part on the information describing utilization of system resources on the second computing system node. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification