×

Identifying data communications algorithms of all other tasks in a single collective operation in a distributed processing system

  • US 8,689,228 B2
  • Filed: 07/19/2011
  • Issued: 04/01/2014
  • Est. Priority Date: 07/19/2011
  • Status: Active Grant
First Claim
Patent Images

1. Apparatus for topology mapping in a distributed processing system, the distributed processing system including a plurality of compute nodes, each compute node executing a plurality of tasks, each task assigned a unique rank, the apparatus comprising a computer processor, a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions that, when executed by the computer processor, cause the apparatus to carry out the steps of:

  • assigning each task to a geometry, wherein the geometry comprises a collection of compute nodes having distinct data communications abilities and resources available to tasks assigned to the geometry, wherein data communications abilities include one or more data communications algorithms supported by each compute nodes; and

    selecting, by each task from a list of possible data communications algorithms, one or more of the possible data communications algorithms supported by the compute node upon which the task is executing in the assigned geometry; and

    identifying, by each task, the selected data communications algorithms of all other tasks in a single collective operation, wherein the identification includes the steps of;

    setting, by each task a string of bits where each bit represents a data communications algorithm available for the task being executed on the compute node, a bit to true for each algorithm selected by the task; and

    performing, by all the tasks, an allreduce operation with a bitwise AND of all string of bits associated with each task.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×