Method for allocating cloud service to servers of data center
First Claim
1. A dynamic planning method for server resources of a data center, adapted to a data center for allocating a service, wherein the service requests to configure d servers, the data center comprising a plurality of pods, each of the pods comprising a plurality of racks respectively connected to a plurality of edge switches, each of the racks being disposed with a plurality of servers, and the servers are sequentially coupled to a plurality of aggregation switches by the connected edge switches, wherein d is a positive integer, the dynamic planning method comprising:
- searching, by the data center, for a rack having a remaining resource space larger than or equal to d from the pods, for allocating the d servers to the rack;
if the rack cannot be found, executing, by the data center, a single pod reallocation on one of the pods to empty one of the racks of the pod, in order to facilitate the rack so the remaining resource space of the rack is larger than or equal to d, and allocating, by the data center, the d servers to the rack, wherein the step of executing the single pod reallocation comprises;
building a placement list comprising a plurality of reallocation placements for the pod, the reallocation placements comprising the exchanges of two server positions in the pod, and the exchanges of the corresponding server positions between the pods;
for each of the racks in the pod, representing all of the reallocation placements of the server positions in the rack by using a bipartite graph, and selecting a plurality of non-overlapping reallocation placement sets by using a maximum cardinality bipartite matching algorithm; and
selecting the rack from the reallocation placement sets in the pod having a placement quantity larger than or equal to d, and executing d reallocation placements on the rack, so as to empty the rack; and
if no rack in the pod can be emptied, executing, by the data center, a cross-pod reallocation on all of the pods to empty the corresponding server positions of the corresponding racks in the pods, in order for the remaining resource space of the corresponding server positions to be larger than or equal to d, and allocating, by the data center, the d servers to the corresponding server positions.
1 Assignment
0 Petitions
Accused Products
Abstract
A dynamic planning method for server resources of a data center is provided. The data center includes multiple pods and each pod includes multiple racks configurable with multiple servers. In the method, a rack having a remaining space larger than or equal to d is searched from the pods and used for configuring d servers. If the rack cannot be configured with the d servers, a single pod reallocation is executed on one of the pods to empty a rack so that the remaining space of the rack is larger than or equal to d and configurable with the d servers. If there is no rack that can be emptied, a cross-pod reallocation is executed to empty corresponding server positions of corresponding racks in the pods so that the remaining space of the corresponding server positions is larger than or equal to d and configurable with the d servers.
12 Citations
20 Claims
-
1. A dynamic planning method for server resources of a data center, adapted to a data center for allocating a service, wherein the service requests to configure d servers, the data center comprising a plurality of pods, each of the pods comprising a plurality of racks respectively connected to a plurality of edge switches, each of the racks being disposed with a plurality of servers, and the servers are sequentially coupled to a plurality of aggregation switches by the connected edge switches, wherein d is a positive integer, the dynamic planning method comprising:
-
searching, by the data center, for a rack having a remaining resource space larger than or equal to d from the pods, for allocating the d servers to the rack; if the rack cannot be found, executing, by the data center, a single pod reallocation on one of the pods to empty one of the racks of the pod, in order to facilitate the rack so the remaining resource space of the rack is larger than or equal to d, and allocating, by the data center, the d servers to the rack, wherein the step of executing the single pod reallocation comprises; building a placement list comprising a plurality of reallocation placements for the pod, the reallocation placements comprising the exchanges of two server positions in the pod, and the exchanges of the corresponding server positions between the pods; for each of the racks in the pod, representing all of the reallocation placements of the server positions in the rack by using a bipartite graph, and selecting a plurality of non-overlapping reallocation placement sets by using a maximum cardinality bipartite matching algorithm; and selecting the rack from the reallocation placement sets in the pod having a placement quantity larger than or equal to d, and executing d reallocation placements on the rack, so as to empty the rack; and if no rack in the pod can be emptied, executing, by the data center, a cross-pod reallocation on all of the pods to empty the corresponding server positions of the corresponding racks in the pods, in order for the remaining resource space of the corresponding server positions to be larger than or equal to d, and allocating, by the data center, the d servers to the corresponding server positions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A dynamic planning method for server resources of a data center, adapted to a data center for allocating a service, wherein the service requests to configure d servers, the data center comprising a plurality of pods, each of the pods comprising a plurality of racks respectively connected to a plurality of edge switches, each of the racks being disposed with a plurality of servers, and the servers are sequentially coupled to a plurality of aggregation switches by the connected edge switches, wherein d is a positive integer, the dynamic planning method comprising:
-
searching, by the data center, for a rack having a remaining resource space larger than or equal to d from the pods, for allocating the d servers to the rack; if the rack cannot be found, executing, by the data center, a cross-rack reallocation on one of the pods to empty one of the corresponding server positions of the racks in the pod, in order for the remaining resource space of the server position to be larger than or equal to d, and allocating, by the data center, the d servers to the server position of each of the racks, wherein the step of executing the cross-rack reallocation comprises; building a placement list comprising a plurality of reallocation placements for the pod, wherein the reallocation placements comprises the exchanges of two server positions in the pod, and the exchanges of the corresponding server positions between the pods; for the corresponding server position in the rack of the pod, representing all of the reallocation placements of the server position by using a bipartite graph, and selecting a plurality of non-overlapping reallocation placement sets by using a maximum cardinality bipartite matching algorithm; and selecting the server position having a placement quantity of the reallocation placement sets larger than or equal to d from the corresponding server positions of the rack, and executing d reallocation placements on the server position, so as to empty the server position; and if no server position in the pod can be emptied, executing, by the data center, a cross-pod reallocation on all of the pods to empty the corresponding server positions of the corresponding racks in the pods, in order for the remaining resource space of the corresponding server positions to be larger than or equal to d, and allocating, by the data center, the d servers to the corresponding server positions. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification