×

Adaptive datacenter topology for distributed frameworks job control through network awareness

  • US 9,785,522 B2
  • Filed: 11/14/2014
  • Issued: 10/10/2017
  • Est. Priority Date: 11/14/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving a priority of a distributed computing job and a set of candidate compute nodes available to process the distributed computing job, the candidate compute nodes each available to process at least one input split, of a plurality of input splits, wherein the plurality of input splits comprise a data input of the distributed computing job;

    determining, based at least in part on a source code of the distributed computing job, that an intermediate traffic type of the distributed computing job is a first intermediate traffic type, of a plurality of intermediate traffic types, wherein the intermediate traffic type reflects a size of a data output generated by processing the distributed computing job relative to a size of the data input of the distributed computing job; and

    selecting a first compute node of the candidate compute nodes as a mapper node for one of the input splits, wherein the first compute node is selected based on the priority and the intermediate traffic type of the distributed computing job and an overall score computed for the first candidate compute node, wherein the first compute node is further selected upon determining that;

    the first compute node is not experiencing an error; and

    a resource utilization score for the first compute node does not exceed a utilization threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×