Predicting datacenter performance to improve provisioning
First Claim
1. A method comprising:
- receiving, by a datacenter provider, a user request comprising details of an application to be executed by a multi-resource, multi-tenant datacenter and a high level constraint associated with the application;
mapping the user request to a set of different candidate resource combinations within the datacenter, each candidate resource combination specifying at least a number of virtual machines and a network bandwidth between virtual machines for executing the application to satisfy the high level constraint;
presenting data relating to at least a subset of the candidate resource combinations to the datacenter provider to enable the datacenter provider to select one of the at least a subset of the candidate resource combinations to execute the application; and
,selecting one of the at least a subset of the candidate resource combinations based on a value of a metric computed for each of the different candidate resource combinations, the metric describing an impact on the ability of the datacenter to accommodate subsequent requests after allocating the candidate resource combination based at least in part on an imbalance in utilization across all resources, wherein the imbalance is calculated as the summation of the square of the quotient of dividing the number of unallocated VM slots by the total number of VM slots and the square of the quotient of dividing unallocated outbound link capacity by the total outbound link capacity.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods of predicting datacenter performance to improve provisioning are described. In an embodiment, a resource manager element receives a request from a tenant which describes an application that the tenant wants executed by a multi-resource, multi-tenant datacenter. The request that has been received is mapped to a set of different candidate resource combinations within the datacenter, where each candidate resource combination can be used to execute the application in a manner which satisfies a high level constraint specified within the request. This mapping may, for example, be performed using a combination of benchmarking and an analytical model. In some examples, each resource combination may comprise a number of virtual machines and a bandwidth between those machines. Data relating to at least a subset (and in some examples, two or more) of the candidate resource combinations is then presented to the tenant.
15 Citations
19 Claims
-
1. A method comprising:
-
receiving, by a datacenter provider, a user request comprising details of an application to be executed by a multi-resource, multi-tenant datacenter and a high level constraint associated with the application; mapping the user request to a set of different candidate resource combinations within the datacenter, each candidate resource combination specifying at least a number of virtual machines and a network bandwidth between virtual machines for executing the application to satisfy the high level constraint; presenting data relating to at least a subset of the candidate resource combinations to the datacenter provider to enable the datacenter provider to select one of the at least a subset of the candidate resource combinations to execute the application; and
,selecting one of the at least a subset of the candidate resource combinations based on a value of a metric computed for each of the different candidate resource combinations, the metric describing an impact on the ability of the datacenter to accommodate subsequent requests after allocating the candidate resource combination based at least in part on an imbalance in utilization across all resources, wherein the imbalance is calculated as the summation of the square of the quotient of dividing the number of unallocated VM slots by the total number of VM slots and the square of the quotient of dividing unallocated outbound link capacity by the total outbound link capacity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 18)
-
-
11. A method comprising:
-
receiving, by a datacenter provider, a user request comprising details of an application to be executed by a multi-resource, multi-tenant datacenter and high level constraint associated with the application; mapping the user request to a set of different candidate resource combinations within the datacenter, each candidate resource combination specifying a combination of datacenter resources for executing the application to satisfy the high level constraint; presenting data relating to two or more candidate resource combinations to the datacenter provider, the data comprising a metric describing an imbalance of resources across the datacenter, the metric being calculated based at least in part on a summation of the square of the quotient of dividing the number of unallocated VM slots by the total number of VM slots and the square of the quotient of dividing unallocated outbound link capacity by the total outbound link capacity, presenting data relating to two or more candidate combinations enabling the datacenter provider to associate a price with each of the presented two or more candidate combinations; and
,presenting the data relating to two or more candidate combinations and the associated prices to the user, presenting the data relating to two or more candidate combinations and associated prices enabling the user to select one of the two or more candidate resource combinations to execute the application. - View Dependent Claims (12, 13, 14, 15, 19)
-
-
16. One or more storage media storing device-executable instructions that, when executed by a computing system, direct the computing system to perform steps comprising:
-
receiving a user request comprising details of a data analytics job to be performed by a datacenter and a high level constraint associated with the job; mapping the user request to a set of different candidate resource combinations within the datacenter, each candidate resource combination specifying at least a number of virtual machines and a network bandwidth between virtual machines for executing the data analytics job to satisfy the high level constraint; presenting data relating to two or more candidate resource combinations to the datacenter provider, the data comprising a metric describing an imbalance of resources across the datacenter, the metric being calculated based at least in part on a summation of the square of the quotient of dividing the number of unallocated VM slots by the total number of VM slots and the square of the quotient of dividing unallocated outbound link capacity by the total outbound link capacity, presenting including enabling the datacenter provider to associate a price with each of the presented two or more candidate combinations; and
,presenting the data relating to two or more candidate combinations and the associated prices to the user, presenting the data relating to two or more candidate combinations and associated prices enabling the user to select one of the two or more candidate resource combinations to execute the application. - View Dependent Claims (17)
-
Specification