WORKLOAD MANAGEMENT WITH DATA ACCESS AWARENESS IN A COMPUTING CLUSTER
First Claim
1. A method for workload management with data access awareness in a computing cluster, by a processor, comprising:
- configuring a workload manager within the computing cluster to include a data requirements evaluator module and a scheduler module; and
in response to receiving an input workload for scheduling by the workload manager;
retrieving, by the data requirements evaluator module, a set of inputs from a storage system, wherein the inputs each include at least one of;
an indication of whether the input workload is intensive in Input/Output (I/O) of new data or intensive in I/O of existing data,data locality proportions for a set of files associated with the input workload, anddata access costs specified for each pair of hosts in the computing cluster;
generating, by the data requirements evaluator module, a list of cluster hosts ranked for performing the input workload according to data access considerations;
providing the ranked list of cluster hosts to the scheduler module; and
generating, by the scheduler module, a scheduling of the input workload to certain hosts within the computing cluster where the generated scheduling is optimized with the data access considerations.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments for workload management with data access awareness in a computing cluster. In response to receiving an input workload for scheduling by a workload manager, a set of inputs is retrieved from a storage system by a data requirements evaluator module. The data requirements evaluator module generates a list of cluster hosts ranked for performing the input workload according to data access considerations and provides the ranked list of cluster hosts to a scheduler module. The scheduler module generates a scheduling of the input workload to certain hosts within the computing cluster where the generated scheduling is optimized with the data access considerations.
-
Citations
24 Claims
-
1. A method for workload management with data access awareness in a computing cluster, by a processor, comprising:
-
configuring a workload manager within the computing cluster to include a data requirements evaluator module and a scheduler module; and in response to receiving an input workload for scheduling by the workload manager; retrieving, by the data requirements evaluator module, a set of inputs from a storage system, wherein the inputs each include at least one of; an indication of whether the input workload is intensive in Input/Output (I/O) of new data or intensive in I/O of existing data, data locality proportions for a set of files associated with the input workload, and data access costs specified for each pair of hosts in the computing cluster; generating, by the data requirements evaluator module, a list of cluster hosts ranked for performing the input workload according to data access considerations; providing the ranked list of cluster hosts to the scheduler module; and generating, by the scheduler module, a scheduling of the input workload to certain hosts within the computing cluster where the generated scheduling is optimized with the data access considerations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for workload management with data access awareness in a computing cluster, the system comprising:
-
a workload manager within the computing cluster, the workload manager including a data requirements evaluator module and a scheduler module; and a processor executing instructions stored in a memory device, wherein the processor, upon executing the instructions; in response to receiving an input workload for scheduling by the workload manager; retrieves, by the data requirements evaluator module, a set of inputs from a storage system, wherein the inputs each include at least one of; an indication of whether the input workload is intensive in Input/Output (I/O) of new data or intensive in I/O of existing data, data locality proportions for a set of files associated with the input workload, and data access costs specified for each pair of hosts in the computing cluster; generates, by the data requirements evaluator module, a list of cluster hosts ranked for performing the input workload according to data access considerations; provides the ranked list of cluster hosts to the scheduler module; and generates, by the scheduler module, a scheduling of the input workload to certain hosts within the computing cluster where the generated scheduling is optimized with the data access considerations. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for workload management with data access awareness in a computing cluster, by a processor, the computer program product embodied on a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
an executable portion that configures a workload manager within the computing cluster to include a data requirements evaluator module and a scheduler module; and an executable portion that, in response to receiving an input workload for scheduling by the workload manager; retrieves, by the data requirements evaluator module, a set of inputs from a storage system, wherein the inputs each include at least one of; an indication of whether the input workload is intensive in Input/Output (I/O) of new data or intensive in I/O of existing data, data locality proportions for a set of files associated with the input workload, and data access costs specified for each pair of hosts in the computing cluster; generates, by the data requirements evaluator module, a list of cluster hosts ranked for performing the input workload according to data access considerations; provides the ranked list of cluster hosts to the scheduler module; and generates, by the scheduler module, a scheduling of the input workload to certain hosts within the computing cluster where the generated scheduling is optimized with the data access considerations. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification