Prioritization-based scaling of computing resources
First Claim
Patent Images
1. A computer-implemented method, comprising:
- receiving, by one or more configured computing systems of a computing resource service, a prioritization policy from a customer of the computing resource service for use in scaling a group of computing nodes provided to the customer by the computing resource service, wherein each of a plurality of scaling policies specifies one or more associated changes to be performed to the group if criteria are satisfied for the scaling policy that include at least one resource utilization metric exceeding a threshold, and wherein the prioritization policy specifies priorities between the plurality of scaling policies;
receiving, by the one or more configured computing systems, information indicating multiple alarms that have been triggered corresponding to multiple scaling policies of the plurality, based on the criteria for the multiple scaling policies being satisfied by monitored values for one or more resource utilization metrics;
selecting, by the one or more configured computing systems and based at least in part on the specified priorities of the prioritization policy, a scaling policy from the multiple scaling policies corresponding to the triggered multiple alarms, wherein the selected scaling policy has a higher priority in the prioritization policy than one or more other unselected scaling policies of the multiple scaling policies;
determining, by the one or more configured computing systems, a change to the group from the one or more associated changes for the selected scaling policy; and
initiating, by the one or more configured computing systems and based on the higher priority of the selected scaling policy in the prioritization policy, the determined change in the group without initiating additional changes to the group that are associated with the one or more other unselected scaling policies of the multiple scaling policies.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described for scaling a group of computing resources. A computing resource service receives a scaling policy for use in scaling the group of computing resources. The scaling policy specifies a target level for a resource utilization metric and magnitude-based changes to the group. The computing resource service receives information about a magnitude of a measurement for the resource utilization metric. The computing resource service determines, based at least in part on the scaling policy, one or more changes for the group and initiates the one or more changes in the group.
57 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
receiving, by one or more configured computing systems of a computing resource service, a prioritization policy from a customer of the computing resource service for use in scaling a group of computing nodes provided to the customer by the computing resource service, wherein each of a plurality of scaling policies specifies one or more associated changes to be performed to the group if criteria are satisfied for the scaling policy that include at least one resource utilization metric exceeding a threshold, and wherein the prioritization policy specifies priorities between the plurality of scaling policies; receiving, by the one or more configured computing systems, information indicating multiple alarms that have been triggered corresponding to multiple scaling policies of the plurality, based on the criteria for the multiple scaling policies being satisfied by monitored values for one or more resource utilization metrics; selecting, by the one or more configured computing systems and based at least in part on the specified priorities of the prioritization policy, a scaling policy from the multiple scaling policies corresponding to the triggered multiple alarms, wherein the selected scaling policy has a higher priority in the prioritization policy than one or more other unselected scaling policies of the multiple scaling policies; determining, by the one or more configured computing systems, a change to the group from the one or more associated changes for the selected scaling policy; and initiating, by the one or more configured computing systems and based on the higher priority of the selected scaling policy in the prioritization policy, the determined change in the group without initiating additional changes to the group that are associated with the one or more other unselected scaling policies of the multiple scaling policies. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer-readable medium having stored contents that, when executed, cause one or more computing systems of a computing resource service to at least:
-
receive, by the one or more computing systems, a prioritization policy from a customer of the computing resource service for use in scaling a group of computing nodes provided to the customer by the computing resource service, wherein each of a plurality of scaling policies specifies one or more associated changes to be performed to the group of computing nodes if criteria are satisfied for the scaling policy that include at least one resource utilization metric exceeding a threshold, and wherein the prioritization policy specifies priorities between the plurality of scaling policies; receive, by the one or more computing systems, information indicating multiple scaling policies of the plurality that have been triggered, based on the criteria for the multiple scaling policies being satisfied by monitored values for one or more resource utilization metrics; select, by the one or more computing systems and based at least in part on the specified priorities of the prioritization policy, a scaling policy from the multiple scaling policies, wherein the selected scaling policy has a higher priority in the prioritization policy than one or more other unselected scaling policies of the multiple scaling policies; determine, by the one or more configured computing systems, a change to the group from the one or more associated changes for the selected scaling policy; and initiate, by the one or more computing systems and based on the higher priority of the selected scaling policy in the prioritization policy, the determined change to the group without initiating additional changes to the group that are associated with the one or more other unselected scaling policies of the multiple scaling policies. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more processors of one or more computing systems; and memory with stored computer executable instructions that, when executed by the one or more processors, cause the system to implement functionality of a computing resource service, including; receiving a prioritization policy from a customer of the computing resource service for use in scaling a group of computing nodes provided to the customer by the computing resource service, wherein each of a plurality of scaling policies specifies one or more associated changes to be performed to the group of computing nodes if criteria are satisfied for the scaling policy that include at least one resource utilization metric exceeding a threshold, and wherein the prioritization policy specifies priorities between the plurality of scaling policies; receiving information indicating multiple scaling policies of the plurality that have been triggered, based on the criteria for the multiple scaling policies being satisfied by monitored values for one or more resource utilization metrics; selecting, based at least in part on the specified priorities of the prioritization policy, a scaling policy from the multiple scaling policies, wherein the selected scaling policy has a higher priority from the prioritization policy than one or more other unselected scaling policies of the multiple scaling policies; determining a change to the group from the one or more associated changes for the selected scaling policy; and initiating, based on the higher priority of the selected scaling policy in the prioritization policy, the determined change in the group without initiating additional changes to the group that are associated with the one or more other unselected scaling policies of the multiple scaling policies. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification