Centralized configuration and monitoring of a distributed computing cluster
First Claim
1. A system to manage and configure a computing cluster including a catalog of hosts, the system comprising:
- a plurality of agents deployed to the catalog of hosts,wherein the agents are configured to start an in-memory process on each of the catalog of hosts to aggregate statistics associated with each of the catalog of hosts,wherein, to aggregate the statistics, the agents are configured to perform a plurality of tests suitable for one or more of;
(1) a distributed file storage system jointly operated among the catalog of hosts, (2) a distributed data processing system jointly operated among the catalog of hosts, or (3) a distributed database management system jointly operated among the catalog of hosts,wherein the plurality of tests, including a canary test, are configured with one or more configurable thresholds, andwherein the agents are further configured to transmit the aggregated statistics and a plurality of heartbeat signals to a server; and
the server, having a memory and a processor, coupled over a network to the computing cluster, wherein the server, when in operation,tracks or updates a data model associated with the catalog of hosts in the computing cluster, wherein the data model specifies one or more of;
services assigned to each of the hosts, configurations assigned to each of the hosts, or configurations regarding the deployed agents;
provides a user interface for inputting configuration settings that are to be deployed among the catalog of hosts in the computing cluster;
configures the agents deployed to the hosts in the computing cluster based on the configuration settings, wherein the configuration settings include the one or more configurable thresholds;
tracks the aggregated statistics received from the agents deployed to the hosts over time; and
depicts on the user interface results from said tracking, wherein the user interface includes a chart that illustrates how the aggregated statistics for a select host changes over time.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for centralized configuration and monitoring of a distributed computing cluster are disclosed. One embodiment of the disclose technology enables deployment and central operation a complete Hadoop stack. The application automates the installation process and reduces deployment time from weeks to minutes. One embodiment further provides a cluster-wide, real time view of the services running and the status of the host machines in a cluster via a single, central place to enact configuration changes across the computing cluster which further incorporates reporting and diagnostic tools to optimize cluster performance and utilization.
161 Citations
17 Claims
-
1. A system to manage and configure a computing cluster including a catalog of hosts, the system comprising:
-
a plurality of agents deployed to the catalog of hosts, wherein the agents are configured to start an in-memory process on each of the catalog of hosts to aggregate statistics associated with each of the catalog of hosts, wherein, to aggregate the statistics, the agents are configured to perform a plurality of tests suitable for one or more of;
(1) a distributed file storage system jointly operated among the catalog of hosts, (2) a distributed data processing system jointly operated among the catalog of hosts, or (3) a distributed database management system jointly operated among the catalog of hosts,wherein the plurality of tests, including a canary test, are configured with one or more configurable thresholds, and wherein the agents are further configured to transmit the aggregated statistics and a plurality of heartbeat signals to a server; and the server, having a memory and a processor, coupled over a network to the computing cluster, wherein the server, when in operation, tracks or updates a data model associated with the catalog of hosts in the computing cluster, wherein the data model specifies one or more of;
services assigned to each of the hosts, configurations assigned to each of the hosts, or configurations regarding the deployed agents;provides a user interface for inputting configuration settings that are to be deployed among the catalog of hosts in the computing cluster; configures the agents deployed to the hosts in the computing cluster based on the configuration settings, wherein the configuration settings include the one or more configurable thresholds; tracks the aggregated statistics received from the agents deployed to the hosts over time; and depicts on the user interface results from said tracking, wherein the user interface includes a chart that illustrates how the aggregated statistics for a select host changes over time. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification