SCALING A VIRTUAL MACHINE INSTANCE
First Claim
Patent Images
1. A computer implemented method for scaling a virtual machine, said method comprising:
- under the control of one or more computer systems configured with executable instructions,receiving, at the one or more computer systems from at least one customer via an application programming interface (API), a request to launch an instance of a virtual machine image;
provisioning, at the one or more computer systems, a virtual machine instance for the at least one customer based at least in part upon the virtual machine image, the virtual machine instance provisioned on a host computing device;
receiving, at the one or more computer systems, from the customer via the API, the customer-defined threshold associated with the virtual machine instance;
monitoring, at the one or more computer systems, one or more metrics associated with the virtual machine instance during execution of the virtual machine instance; and
adjusting, at the one or more computer systems, allocation of one or more computing resources to the virtual machine instance based at least in part on the one or more metrics and the customer-defined threshold, the computing resources including at least one of;
processing resources, networking resources, or memory resources.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described for scaling of computing resources. A scaling service is utilized that allocates additional computing resources (e.g., processors, memory, etc.) to a virtual machine instance (or other compute instance) and/or de-allocates computing resources from a virtual machine instance according requests and/or thresholds. In addition to the foregoing, other aspects are described in the description, figures, and claims.
-
Citations
26 Claims
-
1. A computer implemented method for scaling a virtual machine, said method comprising:
under the control of one or more computer systems configured with executable instructions, receiving, at the one or more computer systems from at least one customer via an application programming interface (API), a request to launch an instance of a virtual machine image; provisioning, at the one or more computer systems, a virtual machine instance for the at least one customer based at least in part upon the virtual machine image, the virtual machine instance provisioned on a host computing device; receiving, at the one or more computer systems, from the customer via the API, the customer-defined threshold associated with the virtual machine instance; monitoring, at the one or more computer systems, one or more metrics associated with the virtual machine instance during execution of the virtual machine instance; and adjusting, at the one or more computer systems, allocation of one or more computing resources to the virtual machine instance based at least in part on the one or more metrics and the customer-defined threshold, the computing resources including at least one of;
processing resources, networking resources, or memory resources.- View Dependent Claims (2, 3)
-
4. A computer implemented method comprising:
under the control of one or more computer systems configured with executable instructions, receiving, at the one or more computer systems from at least one customer via an application programming interface (API), a request for a virtual machine based on a virtual machine image; causing, by a host computing device running in a service provider environment, a virtual machine to be provisioned on the host computing device based at least in part upon the virtual machine image; receiving, from a scaling service running in the service provider environment and operating in accordance with a scaling policy set by the customer, a request to adjust resources allocated to the virtual machine; and in response to receiving the request, adjusting allocation of one or more computing resources to the virtual machine. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A computing system, comprising:
-
at least one processor; and memory including instructions that, when executed by the processor, cause the computing system to; receive a web service request related to adjusting resources allocated to a virtual machine running in a service provider environment provision a virtual machine for a user; and in response to receiving the request, cause a server hosting the virtual machine to allocate one or more computing resources to the virtual machine. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory computer readable storage medium storing one or more sequences of instructions executable by one or more processors to perform a set of operations comprising:
-
causing a virtual machine to be provisioned for a user on a host computing device, the virtual machine capable of executing a workload; receiving an instruction to scale the virtual machine, the instruction received from a scaling service to the host computing device, the scaling service residing externally with respect to the host computing device; and in response to receiving the instruction, adjusting allocation of one or more computing resources to the virtual machine, the one or more computing resources being allocatable by a hypervisor of the host computing device. - View Dependent Claims (24, 25, 26)
-
Specification