Server allocation control method
First Claim
1. A method for adjusting the number of servers belonging to a server group in a network system which includes a plurality of user terminals connected to a network and server groups each containing a plurality of servers connected to said network to process requests from said plurality of user terminals, the method comprising:
- storing the number of requests from said plurality of user terminals for prescribed time intervals;
finding a function to describe a characteristic between time and the number of requests on the basis of a previous stored number of requests;
predicting the number of future requests by substituting a future time for said function;
obtaining a first average response time per server of said plurality of servers by substituting said predicted number of requests in a relational expression of the number of requests and an average response time per server of the plurality of servers where it is hypothesized that the number of requests from said plurality of user terminals follows a prescribed probability distribution;
determining whether said first average response time is a positive value and within a range no more than a threshold value set in advance, and increasing or decreasing the number of servers contained in said server group in accordance with the result of said determination.
1 Assignment
0 Petitions
Accused Products
Abstract
A method to automatically allocate servers to network services in a data center in real time and without need for operators for a load sharing device is provided. In the method, fluctuations in quantity of requests arriving at the network services can be monitored, the value of the quantity of requests for a subsequent fixed time interval can be predicted and, in accordance with the magnitude of the predicted value of the quantity of requests, the quantity of servers allocated to the network services can be controlled. Here, where traffic of the quantity indicated by the predicted value of the quantity of requests arrives at the network services, the number of servers allocated to the network services can be set in such a way that the average response time to the user terminals is equivalent to a response time threshold value or less set in advance by the operations manager.
189 Citations
10 Claims
-
1. A method for adjusting the number of servers belonging to a server group in a network system which includes a plurality of user terminals connected to a network and server groups each containing a plurality of servers connected to said network to process requests from said plurality of user terminals, the method comprising:
-
storing the number of requests from said plurality of user terminals for prescribed time intervals;
finding a function to describe a characteristic between time and the number of requests on the basis of a previous stored number of requests;
predicting the number of future requests by substituting a future time for said function;
obtaining a first average response time per server of said plurality of servers by substituting said predicted number of requests in a relational expression of the number of requests and an average response time per server of the plurality of servers where it is hypothesized that the number of requests from said plurality of user terminals follows a prescribed probability distribution;
determining whether said first average response time is a positive value and within a range no more than a threshold value set in advance, and increasing or decreasing the number of servers contained in said server group in accordance with the result of said determination. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A program for a resource allocation controller connected to a network in a network system which includes a plurality of user terminals connected to said network, server groups each containing a plurality of servers connected to said network to process requests from said user terminals and a load sharing device connected to said network involving a storage storing the number of requests from said user terminal for prescribed time intervals, a distribution rate of said number of requests and a configuration information of said server groups,
the program causing the resource allocation controller to execute the method comprising: -
finding a function to describe a characteristic between time and the number of requests on the basis of a previous stored number of requests stored in the load sharing device;
predicting the number of future requests by substituting a future time for said function;
obtaining a first average response time per server of said plurality of servers by substituting said predicted number of requests in a relational expression of the number of requests and an average response time per server of the plurality of servers where it is hypothesized that the number of requests from said plurality of user terminals follows a prescribed probability distribution;
determining whether said first average response time is a positive value and within a range no more than a threshold value set in advance, and increasing or decreasing the number of servers contained in said server group in accordance with the result of said determination. - View Dependent Claims (7, 8, 9, 10)
-
Specification