Automatic scaling of resource instance groups within compute clusters
First Claim
1. A method, comprising:
- performing, by one or more computers;
detecting that a trigger condition has been met during execution of a distributed application on a cluster of computing resource instances, wherein the cluster comprises two or more non-overlapping instance groups and each instance group comprises a respective one or more computing resource instances; and
in response to said detecting, performing an automatic scaling operation on a particular instance group of the non-overlapping instance groups, wherein the particular instance group on which to perform the automatic scaling operation is determined prior to detecting the trigger condition, wherein determination of the particular instance group is based at least in part on input received from a client of a distributed computing system that includes the cluster, wherein the automatic scaling operation changes the number of computing resource instances on the particular instance group without changing the number of computing resource instances on at least another one of the two or more instance groups.
0 Assignments
0 Petitions
Accused Products
Abstract
A service provider may apply customer-selected or customer-defined auto-scaling policies to a cluster of resources (e.g., virtualized computing resource instances or storage resource instances in a MapReduce cluster). Different policies may be applied to different subsets of cluster resources (e.g., different instance groups containing nodes of different types or having different roles). Each policy may define an expression to be evaluated during execution of a distributed application, a scaling action to take if the expression evaluates true, and an amount by which capacity should be increased or decreased. The expression may be dependent on metrics emitted by the application, cluster, or resource instances by default, metrics defined by the client and emitted by the application, or metrics created through aggregation. Metric collection, aggregation and rules evaluation may be performed by a separate service or by cluster components. An API may support auto-scaling policy definition.
35 Citations
20 Claims
-
1. A method, comprising:
performing, by one or more computers; detecting that a trigger condition has been met during execution of a distributed application on a cluster of computing resource instances, wherein the cluster comprises two or more non-overlapping instance groups and each instance group comprises a respective one or more computing resource instances; and in response to said detecting, performing an automatic scaling operation on a particular instance group of the non-overlapping instance groups, wherein the particular instance group on which to perform the automatic scaling operation is determined prior to detecting the trigger condition, wherein determination of the particular instance group is based at least in part on input received from a client of a distributed computing system that includes the cluster, wherein the automatic scaling operation changes the number of computing resource instances on the particular instance group without changing the number of computing resource instances on at least another one of the two or more instance groups. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A distributed computation system, comprising:
-
one or more computers that comprise at least a processor and a memory and that implement a cluster that comprises two or more non-overlapping instance groups of one or more computing resource instances, wherein the distributed computation system is to; detect that a trigger condition has been met during execution of a distributed application on the cluster of computing resource instances; and in response to detection that the trigger condition has been met, perform an automatic scaling operation on a particular instance group of the non-overlapping instance groups, wherein the particular instance group on which to perform the automatic scaling operation is determined prior to detecting the trigger condition, wherein determination of the particular instance group is based at least in part on input received from a client of a distributed computing system that includes the cluster, wherein the automatic scaling operation changes the number of computing resource instances on the particular instance group without changing the number of computing resource instances on at least another one of the two or more instance groups. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-accessible storage medium storing program instructions that when executed on one or more computers cause the one or more computers to:
-
detect that a trigger condition has been met during execution of a distributed application on a cluster of computing resource instances, wherein the cluster comprises two or more non-overlapping instance groups and each instance group comprises a respective one or more computing resource instances; and in response to said detection, perform an automatic scaling operation on a particular instance group of the instance groups, wherein the particular instance group on which to perform the automatic scaling operation is determined prior to detecting the trigger condition, wherein determination of the particular instance group is based at least in part on input received from a client of a distributed computing system that includes the cluster, wherein the automatic scaling operation changes the number of computing resource instances on the particular instance group without changing the number of computing resource instances on at least another one of the two or more instance groups. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification