×

Scheduling mapReduce tasks based on estimated workload distribution

  • US 9,852,012 B2
  • Filed: 08/26/2015
  • Issued: 12/26/2017
  • Est. Priority Date: 08/26/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving a set of task statistics corresponding to task execution within a MapReduce job;

    estimating a completion time for a set of tasks to be executed to provide an estimated completion time;

    calculating a soft decision point based on a convergence of a workload distribution corresponding to a set of executed tasks;

    calculating a hard decision point (HDP) based on the estimated completion time for the set of tasks to be executed according to the equation HDP=max{0, map task completion time−

    shuffle operation completion time};

    determining a selected decision point based on the soft decision point and the hard decision point; and

    scheduling and executing upcoming tasks based on the selected decision point.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×