Application-based elastic resource provisioning in disaggregated computing systems
First Claim
Patent Images
1. A method for elastic resource provisioning in a disaggregated cloud computing environment, by a processor device, comprising:
- provisioning respective members of pools of hardware resources within the disaggregated cloud computing environment to a tenant according to an application-level service level agreement (SLA); and
upon detecting a potential violation of the application-level SLA, provisioning additional respective members of the pools of hardware resources on a component level to the tenant to avoid a violation of the SLA by each of a scale-up process and a scale-out process based upon workload characteristics to ensure a tenant receives a service-level as specified within the SLA while retaining an overall cost and performance efficiency model utilized in the disaggregated cloud computing environment;
wherein;
the scale-up process is used for scale-up friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a quantity of the respective members of the pools of hardware resources; and
the scale-out process is used scale-out friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a CPU interconnect bandwidth and off-chip memory bandwidth between the respective members of the pool of hardware resources.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments for elastic resource provisioning in a disaggregated cloud computing environment, by a processor device, are provided. Respective members of pools of hardware resources within the disaggregated cloud computing environment are provisioned to a tenant according to an application-level service level agreement (SLA). Upon detecting a potential violation of the application-level SLA, additional respective members of the pools of hardware resources are provisioned on a component level to the tenant to avoid a violation of the SLA by one of a scale-up process and a scale-out process.
-
Citations
18 Claims
-
1. A method for elastic resource provisioning in a disaggregated cloud computing environment, by a processor device, comprising:
-
provisioning respective members of pools of hardware resources within the disaggregated cloud computing environment to a tenant according to an application-level service level agreement (SLA); and upon detecting a potential violation of the application-level SLA, provisioning additional respective members of the pools of hardware resources on a component level to the tenant to avoid a violation of the SLA by each of a scale-up process and a scale-out process based upon workload characteristics to ensure a tenant receives a service-level as specified within the SLA while retaining an overall cost and performance efficiency model utilized in the disaggregated cloud computing environment;
wherein;the scale-up process is used for scale-up friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a quantity of the respective members of the pools of hardware resources; and the scale-out process is used scale-out friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a CPU interconnect bandwidth and off-chip memory bandwidth between the respective members of the pool of hardware resources. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for elastic resource provisioning in a disaggregated cloud computing environment, the system comprising:
at least one processor device, wherein the at least one processor device; provisions respective members of pools of hardware resources within the disaggregated cloud computing environment to a tenant according to an application-level service level agreement (SLA); and upon detecting a potential violation of the application-level SLA, provisions additional respective members of the pools of hardware resources on a component level to the tenant to avoid a violation of the SLA by each of a scale-up process and a scale-out process based upon workload characteristics to ensure a tenant receives a service-level as specified within the SLA while retaining an overall cost and performance efficiency model utilized in the disaggregated cloud computing environment;
wherein;the scale-up process is used for scale-up friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a quantity of the respective members of the pools of hardware resources; and the scale-out process is used scale-out friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a CPU interconnect bandwidth and off-chip memory bandwidth between the respective members of the pool of hardware resources. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A computer program product for elastic resource provisioning in a disaggregated cloud computing environment by at least one processor device, the computer program product embodied on a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
an executable portion that provisions respective members of pools of hardware resources within the disaggregated cloud computing environment to a tenant according to an application-level service level agreement (SLA); and an executable portion that, upon detecting a potential violation of the application-level SLA, provisions additional respective members of the pools of hardware resources on a component level to the tenant to avoid a violation of the SLA by each of a scale-up process and a scale-out process based upon workload characteristics to ensure a tenant receives a service-level as specified within the SLA while retaining an overall cost and performance efficiency model utilized in the disaggregated cloud computing environment;
wherein;the scale-up process is used for scale-up friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a quantity of the respective members of the pools of hardware resources; and the scale-out process is used scale-out friendly workloads, and comprises provisioning the additional respective members of the pools of hardware resources according to a CPU interconnect bandwidth and off-chip memory bandwidth between the respective members of the pool of hardware resources. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification