Method and apparatus for dynamically scheduling requests
First Claim
1. A method of processing a request to at least one server, comprising the steps of:
- a processor receiving the request; and
the processor determining when to submit the request to the at least one server based on;
(i) a quality-of-service (QoS) class assigned to a client from which the request originated;
(ii) a response target associated with the QoS class; and
(iii) an estimated response time associated with the at least one server.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques are provided for dynamically scheduling requests in data processing systems in accordance with differentiated service levels. In a first aspect of the invention, a technique for processing a request to at least one server comprises the following steps. A request is received. Then, submission of the request to the at least one server is scheduled based on: (i) a quality-of-service (QoS) class assigned to a client from which the request originated; (ii) a response target associated with the QoS class; and (iii) an estimated response time associated with the at least one server. The technique may further comprise the step of withholding the request from submission to the at least one server when the request originated from a client assigned to a first QoS class to allow a request that originated from a client assigned to a second QoS class to meet a response target associated therewith.
35 Citations
25 Claims
-
1. A method of processing a request to at least one server, comprising the steps of:
-
a processor receiving the request; and the processor determining when to submit the request to the at least one server based on;
(i) a quality-of-service (QoS) class assigned to a client from which the request originated;
(ii) a response target associated with the QoS class; and
(iii) an estimated response time associated with the at least one server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. Apparatus for processing a request to at least one server, comprising:
-
a memory; and at least one processor coupled to the memory and operative to receive a request, and determine when to submit the request to the at least one server based on;
(i) a quality-of-service (QoS) class assigned to a client from which the request originated;
(ii) a response target associated with the QoS class; and
(iii) an estimated response time associated with the at least one server;wherein scheduling submission of the request to the at least one server comprises determining when to submit the request to the at least one server. - View Dependent Claims (15, 16)
-
-
17. An article of manufacture for processing a request to at least one server, comprising a computer readable medium containing one or more programs which when executed implement the steps of:
-
receiving the request; and determining when to submit the request to the at least one server based on;
(i) a quality-of service (QoS) class assigned to a client from which the request originated;
(ii) a response target associated with the QoS class; and
(iii) an estimated response time associated with the at least one server.
-
-
18. A method of processing requests to at least one server, comprising the steps of:
-
assigning at least one client to a quality-of-service (QoS) class from among at least two QoS classes; assigning a response target to at least one QoS class; estimating at least one response time of the at least one server based on one or more requests sent to the server within a given time period; and a processor withholding submission of requests associated with a first one of the at least two QoS classes to allow requests associated with a second one of the at least two QoS classes to meet its response target based on the at least one estimated response time. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. A method of providing a scheduling service for requests to at least one server, comprising the step of:
a service provider providing a scheduler comprising a processor operative to;
(i) assign at least one client to a quality-of-service (QoS) class from among at least two QoS classes;
(ii) assign a response target to at least one QoS class;
(iii) estimate at least one response time of the at least one server based on one or more requests sent to the server within a given time period; and
(iv) withhold submission of requests associated with a first one of the at least two QoS classes to allow requests associated with a second one of the at least two QoS classes to meet its response target based on the at least one estimated response time.
Specification