COST OPTIMIZATION OF CLOUD COMPUTING RESOURCES
First Claim
1. A system for managing cloud compute resources, the system comprising:
- at least one processor operatively connected to a memory, the at least one processor when executing configured to;
obtain, store, and analyze historical time series of cloud compute resource (“
resource”
) characteristics, including at least cost and availability, for one or more resource types from one or more cloud compute providers (“
providers”
);
predict one or more resource characteristics over a future time duration for executing a submitted compute task at the one or more cloud compute providers;
at least one application program interface (API) configured to;
monitor resource characteristics for the one or more resource types from the one or more cloud compute providers;
update predictions for one or more resource characteristics for remaining portions of the future time duration; and
provide the historical time series of resource characteristics and analysis of the time series, the analysis including the predictions of the one or more resource characteristics for determining allocation and execution of the compute task at one or more of the cloud compute providers.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for managing and/or executing cloud compute instances that may be pre-empted by their providers, known in at least one provider as “spot instances”. Various aspects manage early termination (i.e., before customer deallocation or compute task completion) of spot instances, enabling improved utilization, and increasing reliability of executing tasks on spot instances. In some embodiments, systems and/or methods are provided for prediction of when spot instance kills are likely to occur. These systems and/or methods can gracefully handle spot kills issued by Providers (e.g., trigger preservation operations), trigger transitions to spot instances at another Provider, trigger increased bidding to preserve current spot instances, and/or trigger a premium payment to allow for execution of preservation operations, among other options. Other embodiments enable customers to directly manage instance kill operations (e.g., a customer or a customer'"'"'s program selects instances to terminate via a UI or API, etc.).
-
Citations
20 Claims
-
1. A system for managing cloud compute resources, the system comprising:
-
at least one processor operatively connected to a memory, the at least one processor when executing configured to; obtain, store, and analyze historical time series of cloud compute resource (“
resource”
) characteristics, including at least cost and availability, for one or more resource types from one or more cloud compute providers (“
providers”
);predict one or more resource characteristics over a future time duration for executing a submitted compute task at the one or more cloud compute providers; at least one application program interface (API) configured to; monitor resource characteristics for the one or more resource types from the one or more cloud compute providers; update predictions for one or more resource characteristics for remaining portions of the future time duration; and provide the historical time series of resource characteristics and analysis of the time series, the analysis including the predictions of the one or more resource characteristics for determining allocation and execution of the compute task at one or more of the cloud compute providers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for managing cloud compute resources, the method comprising:
-
obtaining, storing, and analyzing historical time series of cloud compute resource (“
resource”
) characteristics, including at least cost and availability, for one or more resource types from one or more cloud compute providers by at least one application program interface (API);predicting, by a computer system, one or more resource characteristics over a future time duration for executing a submitted compute task at the one or more cloud compute providers; monitoring, by at least one API, resource characteristics for the one or more resource types from the one or more cloud compute providers; updating, by at least one API, predictions for one or more resource characteristics for remaining portions of the future time duration; and providing, by at least one API, the historical time series of resource characteristics and analysis of these time series, the analysis including the predictions of the one or more resource characteristics for determining allocation and execution of the compute task at one or more of the cloud compute providers. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification