Sequential cooperation between map and reduce phases to improve data locality
First Claim
Patent Images
1. An apparatus comprising:
- at least one processor; and
a computer readable storage medium having computer readable program code embodied therewith and executable by the at least one processor, the computer readable program code comprising;
computer readable program code configured to assimilate at least one job from at least one node, each job of the at least one job comprising at least a map phase and a reduce phase, each of the map and reduce phases comprising at least one task;
computer readable program code configured to compare a percentage of completed map tasks of a map phase of the at least one job with a percentage of started reducers of a the reduce phase of the at least one job; and
computer readable program code configured to schedule incremental launching of a the at least one task of a reduce phase of at least one job in response to the percentage of started reducers of the reduce phase of the at least one job being less than the percentage of completed map tasks of the map phase of the at least one job.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods and arrangements for task scheduling. At least one job is assimilated from at least one node, each job comprising at least a map phase and a reduce phase, each of the map and reduce phases comprising at least one task. Progress of a map phase of at least one job is compared with progress of a reduce phase of at least one job. Launching of a task of a reduce phase of at least one job is scheduled in response to progress of the reduce phase of at least one job being less than progress of the map phase of at least one job.
46 Citations
13 Claims
-
1. An apparatus comprising:
-
at least one processor; and a computer readable storage medium having computer readable program code embodied therewith and executable by the at least one processor, the computer readable program code comprising; computer readable program code configured to assimilate at least one job from at least one node, each job of the at least one job comprising at least a map phase and a reduce phase, each of the map and reduce phases comprising at least one task; computer readable program code configured to compare a percentage of completed map tasks of a map phase of the at least one job with a percentage of started reducers of a the reduce phase of the at least one job; and computer readable program code configured to schedule incremental launching of a the at least one task of a reduce phase of at least one job in response to the percentage of started reducers of the reduce phase of the at least one job being less than the percentage of completed map tasks of the map phase of the at least one job.
-
-
2. A computer program product comprising:
-
a computer readable storage medium, not including a signal, having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to assimilate at least one job from at least one node, each job of the at least one job comprising at least a map phase and a reduce phase, each of the map and reduce phases comprising at least one task; computer readable program code configured to compare the percentage of completed map tasks of a map phase of the at least one job with the percentage of started reducers of a the reduce phase of the at least one job; and computer readable program code configured to schedule incremental launching of a the at least one task of a reduce phase of at least one job in response to the percentage of started reducers of the reduce phase of the at least one job being less than the percentage of completed map tasks of the map phase of the at least one job. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
Specification