×

Performing optimized collective operations in an irregular subcommunicator of compute nodes in a parallel computer

  • US 10,382,527 B2
  • Filed: 10/16/2013
  • Issued: 08/13/2019
  • Est. Priority Date: 10/16/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing optimized collective operations in an irregular subcommunicator of compute nodes in a parallel computer, the method comprising:

  • identifying, within the irregular subcommunicator of the compute nodes in the parallel computer, regular neighborhoods of compute nodes within the irregular subcommunicator of the compute nodes in the parallel computer, wherein the irregular subcommunicator of the compute nodes in the parallel computer has topological communication gaps and the regular neighborhoods of the compute nodes in the parallel computer are logical planes with no topological communication gaps in each neighborhood, wherein identifying regular neighborhoods of compute nodes within the irregular subcommunicator of the compute nodes in the parallel computer comprises;

    establishing, by each respective compute node within the irregular subcommunicator of the compute nodes in the parallel computer, at least one logical plane that includes the respective compute node, wherein establishing the at least one logical plane comprises;

    identifying, in a positive direction of a first dimension, each logical plane that includes the respective compute node, a first compute node of the irregular subcommunicator that is one or more hops away from the respective compute node in a positive direction of a second dimension, wherein the second dimension is orthogonal to the first dimension;

    identifying, in a negative direction of the first dimension, each logical plane that includes the respective compute node and a second compute node of the irregular subcommunicator that is one or more hops away from the respective compute node in the positive direction of the second dimension;

    selecting, for each neighborhood from the compute nodes of the neighborhood, a local root node;

    assigning each local root node to a node of a neighborhood-wide tree topology;

    mapping, for each neighborhood, the compute nodes of the neighborhood to a local tree topology having, at its root, the local root node of the neighborhood; and

    performing a one way, rooted collective operation within the irregular subcommunicator including;

    performing, in one phase, the collective operation within each neighborhood and performing in another phase, the collective operation amongst the local root nodes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×