System and method for adaptive admission control and resource management for service time guarantees
First Claim
Patent Images
1. A system for providing computer server servicing at a plurality of server resources of client requests for a plurality of request classes each having different response time guarantees comprising:
- a computer processing unit (CPU) for executing a resource management component, a monitoring component, an adaptation engine, and a server resource scheduler;
the resource management component comprising a component for accepting and dynamically scheduling client requests and allocating system resources for request handling based on request classes, a predetermined scheduling policy and received adjustment input, said resource management component comprising a classifier for classifying incoming client requests into each of said plurality of request classes, a plurality of class-based accept queues for queuing said incoming client requests, and an accept queue scheduler for determining the order in which queued client requests are provided for servicing based on said classes, the predetermined policy and the adjustment input;
the monitoring component comprising a component for monitoring the handling of requests in different request classes, for monitoring resource usage and for providing request handling and resource usage feedback results to an adaptation engine;
the adaptation engine comprising an engine for evaluating the request handling and resource usage feedback results of said monitoring and for generating adjustment input to said resource management component to adjust the allocating of system resources for each class based on the evaluating of request handling and resource usage feedback results and response time guarantees; and
the server resource scheduler comprising a scheduler for allocating resources to service queued client requests provided from said accept queue scheduler, wherein the server resource scheduler is a hierarchical start-time fair queuing scheduler.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system architecture to manage system resources in such a way as to provide service time guarantees to various classes of work using the system. In the case that the system resources are not sufficient to provide the specified service time guarantees, a method and system architecture is provided to limit work into the system such that the guarantees are satisfied and the revenue derived from the system is maximized.
33 Citations
9 Claims
-
1. A system for providing computer server servicing at a plurality of server resources of client requests for a plurality of request classes each having different response time guarantees comprising:
-
a computer processing unit (CPU) for executing a resource management component, a monitoring component, an adaptation engine, and a server resource scheduler; the resource management component comprising a component for accepting and dynamically scheduling client requests and allocating system resources for request handling based on request classes, a predetermined scheduling policy and received adjustment input, said resource management component comprising a classifier for classifying incoming client requests into each of said plurality of request classes, a plurality of class-based accept queues for queuing said incoming client requests, and an accept queue scheduler for determining the order in which queued client requests are provided for servicing based on said classes, the predetermined policy and the adjustment input; the monitoring component comprising a component for monitoring the handling of requests in different request classes, for monitoring resource usage and for providing request handling and resource usage feedback results to an adaptation engine; the adaptation engine comprising an engine for evaluating the request handling and resource usage feedback results of said monitoring and for generating adjustment input to said resource management component to adjust the allocating of system resources for each class based on the evaluating of request handling and resource usage feedback results and response time guarantees; and the server resource scheduler comprising a scheduler for allocating resources to service queued client requests provided from said accept queue scheduler, wherein the server resource scheduler is a hierarchical start-time fair queuing scheduler. - View Dependent Claims (2, 3, 4)
-
-
5. A method for providing computer server servicing at a plurality of server resources of client requests for a plurality of request classes each having different response time guarantees comprising the steps of:
-
accepting and dynamically scheduling client requests and allocating system resources for request handling based on request classes, a predetermined scheduling policy and received adjustment input by the steps of classifying incoming client requests into each of said plurality of request classes, queuing classified incoming client request into a plurality of class-based accept queues, and determining the order in which queued client requests are provided for servicing based on said classes, the predetermined policy and adjustment input; monitoring the handling of requests in different request classed and server resource usage and for providing request handling and resource usage feedback results for evaluation; evaluating the request handling and resource usage feedback results of said monitoring; and generating adjustment input for the dynamically scheduling to adjust resource allocations for each class based on said evaluating of request handling and resource usage feedback results and the response time guarantees, wherein the allocating is performed by a hierarchical start-time fair queuing scheduler. - View Dependent Claims (6, 7, 8)
-
-
9. A program storage device readable by machine storing a program of instructions executable by the machine to perform method steps for providing computer server servicing at a plurality of server resources of client requests for a plurality of request classes each having different response time guarantees, said method comprising the steps of:
-
accepting and dynamically scheduling client requests and allocating system resources for request handling based on request classes, a predetermined scheduling policy and received adjustment input by the steps of classifying incoming client requests into each of said plurality of request classes, queuing classified incoming client request into a plurality of class-based accept queues, and determining the order in which queued client requests are provided for servicing based on said classes, the predetermined policy and adjustment input; monitoring the handling of requests in different request classed and server resource usage and for providing request handling and resource usage feedback results for evaluation; evaluating the request handling and resource usage feedback results of said monitoring; and generating adjustment input for said dynamically scheduling to adjust the allocating of system resources for each class based on said evaluating of request handling and resource usage feedback results and the response time guarantees, wherein the allocating is performed by a hierarchical start-time fair queuing scheduler.
-
Specification