Acquisition and maintenance of compute capacity
First Claim
1. A system for providing low-latency computational capacity, the system comprising:
- a virtual compute system comprising one or more hardware computing devices executing specific computer-executable instructions and configured to at least;
maintain a plurality of virtual machine instances on one or more physical computing devices, wherein the plurality of virtual machine instances comprises;
a warming pool comprising a first sub-pool of virtual machine instances and a second sub-pool of virtual machine instances, wherein the first sub-pool of virtual machine instances are associated with one or more fixed resource constraints and the second sub-pool of virtual machine instances are associated with one or more variable resource constraints; and
an active pool comprising virtual machine instances executing one or more program codes thereon;
receive a first request to execute a first program code associated with a first user, the first request including an identity of a user account associated with the first user and one or more parameters for executing the first program code;
determine, based on the identity of the user account, that the first program code is to be executed using a virtual machine instance associated with the second sub-pool that satisfies a first variable resource constraint;
in response to a determination that the second sub-pool lacks a virtual machine instance satisfying the first variable resource constraint, wait until a first virtual machine instance satisfying the first variable resource constraint becomes available in the second sub-pool;
cause the first virtual machine instance in the second sub-pool to be added to the active pool; and
cause the first program code to be executed, using the one or more parameters, in a container created on the first virtual machine instance.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for providing low-latency computational capacity from a virtual compute fleet is provided. The system may be configured to maintain a plurality of virtual machine instances on one or more physical computing devices, wherein the plurality of virtual machine instances comprises a first pool comprising a first sub-pool of virtual machine instances and a second sub-pool of virtual machine instances, and a second pool comprising virtual machine instances used for executing one or more program codes thereon. The first sub-pool and/or the second sub-pool may be associated with one or more users of the system. The system may be further configured to process code execution requests and execute program codes on the virtual machine instances of the first or second sub-pool.
364 Citations
20 Claims
-
1. A system for providing low-latency computational capacity, the system comprising:
a virtual compute system comprising one or more hardware computing devices executing specific computer-executable instructions and configured to at least; maintain a plurality of virtual machine instances on one or more physical computing devices, wherein the plurality of virtual machine instances comprises; a warming pool comprising a first sub-pool of virtual machine instances and a second sub-pool of virtual machine instances, wherein the first sub-pool of virtual machine instances are associated with one or more fixed resource constraints and the second sub-pool of virtual machine instances are associated with one or more variable resource constraints; and an active pool comprising virtual machine instances executing one or more program codes thereon; receive a first request to execute a first program code associated with a first user, the first request including an identity of a user account associated with the first user and one or more parameters for executing the first program code; determine, based on the identity of the user account, that the first program code is to be executed using a virtual machine instance associated with the second sub-pool that satisfies a first variable resource constraint; in response to a determination that the second sub-pool lacks a virtual machine instance satisfying the first variable resource constraint, wait until a first virtual machine instance satisfying the first variable resource constraint becomes available in the second sub-pool; cause the first virtual machine instance in the second sub-pool to be added to the active pool; and cause the first program code to be executed, using the one or more parameters, in a container created on the first virtual machine instance. - View Dependent Claims (2, 3, 4)
-
5. A system for providing low-latency computational capacity, the system comprising:
a virtual compute system comprising one or more hardware computing devices executing specific computer-executable instructions and configured to at least; maintain a plurality of virtual machine instances on one or more physical computing devices, wherein the plurality of virtual machine instances comprises; a first pool comprising virtual machine instances associated with one or more variable resource constraints; and a second pool comprising virtual machine instances executing one or more program codes thereon; receive a first request to execute a first program code associated with a first user, the first request including user account information and one or more parameters for executing the first program code; determine, based on the user account information, that the first program code is to be executed using a virtual machine instance associated with the first pool that satisfies a first variable resource constraint; subsequent to a first virtual machine instance satisfying the first variable resource constraint becoming available in the first pool, cause the first virtual machine instance in the first pool to be added to the second pool; and cause the first program code to be executed, using the one or more parameters, in a container created on the first virtual machine instance. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
13. A computer-implemented method comprising:
as implemented by one or more computing devices configured with specific executable instructions, maintaining a plurality of virtual machine instances on one or more physical computing devices, wherein the plurality of virtual machine instances comprises; a first pool comprising virtual machine instances associated with one or more variable resource constraints; and a second pool comprising virtual machine instances executing one or more program codes thereon; receiving a first request to execute a first program code associated with a first user, the first request including user account information and one or more parameters for executing the first program code; determining, based on the user account information, that the first program code is to be executed using a virtual machine instance associated with the first pool that satisfies a first variable resource constraint; subsequent to a first virtual machine instance satisfying the first variable resource constraint becoming available in the first pool, causing the first virtual machine instance in the first pool to be added to the second pool; and causing the first program code to be executed, using the one or more parameters, in a container created on the first virtual machine instance. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
Specification