×

PERFORMING COLLECTIVE OPERATIONS IN A DISTRIBUTED PROCESSING SYSTEM

  • US 20130081037A1
  • Filed: 11/16/2012
  • Published: 03/28/2013
  • Est. Priority Date: 07/13/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing collective operations on a hybrid distributed processing system, the hybrid distributed processing system including a plurality of compute nodes, each compute node having a plurality of tasks, each task assigned a unique rank, each compute node coupled for data communications by at least one data communications network implementing at least two different networking topologies, wherein a first networking topology comprises a tiered tree topology having a root task, and at least two child tasks, where the two child tasks are peers of one another in the same tier, the method comprising:

  • determining by at least one task that a parent of the task has failed to send the task data through the tree topology; and

    determining whether to request the data from a grandparent of the task or a peer of the task in the same tier in the tree topology;

    if the task requests the data from the grandparent, requesting the data and receiving the data from the grandparent of the task through the second networking topology; and

    if the task requests the data from a peer of the task in the same tier in the tree, requesting the data and receiving the data from a peer of the task through the second networking topology.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×