System of assigning work requests based on classifying into an eligible class where the criteria is goal oriented and capacity information is available
First Claim
1. In a multi-processing environment having a plurality of computer systems, each of said systems having an operating system instance resident thereon and providing at least one application server, a method for assigning work requests among individual ones of the servers in order to meet business goals inherent in a policy governing the environment, the method comprising the steps of:
- in response to an incoming request to route work,classifying each active one of the systems either as an eligible system that then exhibits at least a minimum pre-defined capacity utilization, at a lowest one of a number of pre-defined business importance levels among all the systems, over a first pre-defined time interval, or as a candidate system so as to form first and second sets of eligible and candidate systems, respectively;
each eligible system being one of the systems that is goal-oriented and runs under the policy for which current capacity information is available, and each candidate system being any remaining one of said plurality of systems other than all the eligible systems; and
forming, in conjunction with said classifying step, third and fourth sets of eligible and candidate servers, respectively, such that said third and fourth sets contain identifications of all active ones of the servers residing on each of the systems in said first and second sets, respectively;
assigning, in response to said classifying step, a system weight to each one of the systems in the first and second sets, the system weight representing an amount of total available capacity utilized at the lowest one of the business importance levels, by said each one system in the first and second sets;
determining, in conjunction with said system weight assigning step and for each different one of the systems in said first and second sets, a corresponding server weight for each different server residing on said each different one system;
forming, in response to said third and fourth sets, an output server set populated first by identifications of eligible servers, and their associated server weights, taken, successively and in order of descending server weights, from said third set followed, by identification of servers, and their associated server weights, taken successively from said fourth set; and
routing work requests to each one of the servers identified in the output server set wherein, of a total number of work requests to be routed to all of the identified servers, an amount of the total work requests is routed to each one identified server in the output server set in proportion to the server weight associated therewith.
1 Assignment
0 Petitions
Accused Products
Abstract
Apparatus and accompanying methods for use preferably in a multi-system shared data (sysplex) environment (100), wherein each system (110) provides one or more servers (115), for dynamically and adaptively assigning and balancing new work and for new session requests, among the servers in the sysplex, in view of attendant user-defined business importance of these requests and available sysplex resource capacity so as to meet overall business goals. Specifically, systems and servers are categorized into two classes: eligible, i.e., goal-oriented servers running under a policy and for which capacity information is currently available, and candidate, i.e., servers which lack capacity information. Work requests for a client application are assigned first to various eligible systems and eligible servers thereon based on their current capacity to accept new work and in a manner that meets business goals inherent in a sysplex policy; followed, if additional servers are requested by that application, to candidate systems and candidate servers thereon. As to session placement, first those system(s) are selected that have lowest utilization, at a target importance level, but with sufficient available capacity at that level. Competing servers on the selected system(s) are then evaluated based on their corresponding session count data to yield a single resulting server. Thereafter, identification of multiple servers and their corresponding weights are returned to, e.g., a client application, for eventual routing of work requests to those servers, or the identification of a single server is returned to that application for establishing a new session therewith.
267 Citations
44 Claims
-
1. In a multi-processing environment having a plurality of computer systems, each of said systems having an operating system instance resident thereon and providing at least one application server, a method for assigning work requests among individual ones of the servers in order to meet business goals inherent in a policy governing the environment, the method comprising the steps of:
-
in response to an incoming request to route work, classifying each active one of the systems either as an eligible system that then exhibits at least a minimum pre-defined capacity utilization, at a lowest one of a number of pre-defined business importance levels among all the systems, over a first pre-defined time interval, or as a candidate system so as to form first and second sets of eligible and candidate systems, respectively;
each eligible system being one of the systems that is goal-oriented and runs under the policy for which current capacity information is available, and each candidate system being any remaining one of said plurality of systems other than all the eligible systems; andforming, in conjunction with said classifying step, third and fourth sets of eligible and candidate servers, respectively, such that said third and fourth sets contain identifications of all active ones of the servers residing on each of the systems in said first and second sets, respectively; assigning, in response to said classifying step, a system weight to each one of the systems in the first and second sets, the system weight representing an amount of total available capacity utilized at the lowest one of the business importance levels, by said each one system in the first and second sets; determining, in conjunction with said system weight assigning step and for each different one of the systems in said first and second sets, a corresponding server weight for each different server residing on said each different one system; forming, in response to said third and fourth sets, an output server set populated first by identifications of eligible servers, and their associated server weights, taken, successively and in order of descending server weights, from said third set followed, by identification of servers, and their associated server weights, taken successively from said fourth set; and routing work requests to each one of the servers identified in the output server set wherein, of a total number of work requests to be routed to all of the identified servers, an amount of the total work requests is routed to each one identified server in the output server set in proportion to the server weight associated therewith. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. In a multi-processing environment having a plurality of computer systems, each of said systems having an operating system instance resident thereon and providing at least one application server, apparatus for assigning work requests among individual ones of the servers in order to meet business goals inherent in a policy governing the environment, the apparatus comprising:
-
means, responsive to an incoming request to route work, for classifying each active one of the systems either as an eligible system that then exhibits at least a minimum pre-defined capacity utilization, at a lowest one of a number of pre-defined business importance levels among all the systems, over a first pre-defined time interval, or as a candidate system so as to form first and second sets of eligible and candidate systems, respectively;
each eligible system being one of the systems that is goal-oriented and runs under the policy for which current capacity information is available, and each candidate system being any remaining one of said plurality of systems other than all the eligible systems; andmeans, operative in conjunction with said classifying means, for forming third and fourth sets of eligible and candidate servers, respectively, such that said third and fourth sets contain identifications of all active ones of the servers residing on each of the systems in said first and second sets, respectively; means for assigning a system weight to each one of the systems in the first and second sets, the system weight representing an amount of total available capacity utilized at the lowest one of the business importance levels, by said each one system in the first and second sets; means for determining, in conjunction with said system weight assigning means and for each different one of the systems in said first and second sets, a corresponding server weight for each different server residing on said each different one system; means for forming, in response to said third and fourth sets, an output server set populated first by identifications of eligible servers, and their associated server weights, taken, successively and in order of descending server weights, from said third set followed, by identification of servers, and their associated server weights, taken successively from said fourth set; and means for routing work requests to each one of the servers identified in the output server set wherein, of a total number of work requests to be routed to all of the identified servers, an amount of the total work requests is routed to each one identified server in the output server set in proportion to the server weight associated therewith. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification