System and method for allocating server resources
First Claim
Patent Images
1. A server system comprising:
- at least two scaleable tiers of server machines;
a server pool including plural spare server machines;
means for computing an average response time for the server system to respond to transaction requests at the two scaleable tiers of server machines; and
means for increasing a number of server machines processing transactions for each of the two scaleable tiers of server machines by allocating the spare server machines to process a portion of the transactions, wherein the spare server machines are allocated to process a portion of the transactions when the average response time for the server system to respond to the transaction requests is greater than or equal to a specified average response time, wherein said means for computing further comprises a non-iterative queuing model for predicting the average response time for the server system in response to measured arrival rates of transaction requests into each of the two scaleable tiers of server machines, an average service demand at each of the two scaleable tiers of server machines, and a number of servers allocated to each of the two scaleable tiers of server machines.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method employing an allocation process for determining the number of server machines at each tier of a multiple tiered server system. The allocation process evaluates the number of server machines at each tier sufficient to achieve an average response time of a transaction request to be processed by the server system in response to changes in the average number of transaction requests. The allocation process also identifies shadow pricing enabling analysis of the cost associated with incremental changes in the average response time or other critical system resources.
-
Citations
14 Claims
-
1. A server system comprising:
-
at least two scaleable tiers of server machines; a server pool including plural spare server machines; means for computing an average response time for the server system to respond to transaction requests at the two scaleable tiers of server machines; and means for increasing a number of server machines processing transactions for each of the two scaleable tiers of server machines by allocating the spare server machines to process a portion of the transactions, wherein the spare server machines are allocated to process a portion of the transactions when the average response time for the server system to respond to the transaction requests is greater than or equal to a specified average response time, wherein said means for computing further comprises a non-iterative queuing model for predicting the average response time for the server system in response to measured arrival rates of transaction requests into each of the two scaleable tiers of server machines, an average service demand at each of the two scaleable tiers of server machines, and a number of servers allocated to each of the two scaleable tiers of server machines. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for allocating a server machine to at least two tiers of a server system, said method comprising:
-
computing, by a computer, an expected average response time as a function of transaction requests and an amount of resources allocated to each of the two tiers of the server system; determining whether an optimization problem is feasible; computing a lower bound and an upper bound on a number of server machines at each of the two tiers of said server system required to meet the average response time; computing a solution specifying a number of server machines allocated to each of the two tiers of said server system; computing an average time that transaction requests are pending at each of the two tiers; automatically increasing the number of server machines, from a pool of server machines, allocated to one of the two tiers at a point in time when the average time the transaction requests are pending at the one of the two tiers is greater than or equal to a pre-determined limit; and predicting, with a non-iterative queuing model, an average server system response time in response to measured arrival rates of transaction requests into said two tiers of server machines, an average service demand at said two tiers of server machines; and
a number of servers allocated to said two tiers of server machines. - View Dependent Claims (8)
-
-
9. An assembly for allocating server machines in a server system comprising:
-
at least two tiers of server machines; a pool of spare server machines that process transactions for the two tiers of server machines; means for computing an average response time for said two tiers of server machines to respond to a plurality of transaction requests; and means for increasing and decreasing a number of server machines from said pool that process transactions for said two tiers of server machines when average response times for processing transactions at the two tiers of server machines exceed a specified average response time, wherein said means for computing further comprises a non-iterative queuing model for predicting an average server system response time in response to measured arrival rates of transaction requests into said two tiers of server machines, an average service demand at said two tiers of server machines, and a number of servers allocated to said two tiers of server machines. - View Dependent Claims (10, 11)
-
-
12. A server system comprising:
-
an open queuing network of multiple server machines with each server machine having a processor-sharing queue with a single critical resource; at least two tiers of server machines; and a computer-readable medium comprising instructions for; (i) predicting an average system response time of said multiple server machines based on an arrival rate of transaction requests into each of the two tiers of server machines averaged over all transaction request types and a number of server machines allocated at each of the two tiers of server machines; (ii) solving a mathematical representation of an optimization objective and constraints of said server system; (iii) determining a number of server machines for each of the two tiers of server machines in response to said predicted the average system response time; (iv) automatically increasing the number of server machines, from a pool of server machines, processing transactions for each of the two tiers of server machines at a point in time when an average time that transactions requests are pending at the two tiers of server machines exceeds a threshold; and (v) predicting, with a non-iterative queuing model, an average server system response time in response to measured arrival rates of transaction requests into said two tiers of server machines, an average service demand at said two tiers of server machines, and a number of servers allocated to said two tiers of server machines. - View Dependent Claims (13, 14)
-
Specification