×

System and method for topology-aware job scheduling and backfilling in an HPC environment

DC
  • US 8,910,175 B2
  • Filed: 10/11/2013
  • Issued: 12/09/2014
  • Est. Priority Date: 04/15/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • determining, using one or more computers, available space in a virtual cluster of a plurality of communicatively coupled nodes included in a computing environment, the virtual cluster associated with a group of users that submit similar jobs, and comprising a logical grouping of nodes configured to process related jobs;

    determining an optimum job that is compatible with the available space in the virtual cluster of nodes by;

    determining a number of available nodes in the virtual cluster;

    selecting a first job from a job queue;

    dynamically determining an optimum shape of the first job;

    determining whether the number of available nodes is enough to execute the first job, based on the optimum shape thereof; and

    dynamically allocating one or more of the available nodes for the first job, in the event that the determined number of available nodes is enough to execute the first job;

    wherein the optimum shape comprises one or more of;

    a best fit cube in which the one or more available nodes are allocated in a cubic volume so as to allow cooperating tasks to exchange data with any other tasks by minimizing the distance between any two nodes; and

    a best fit sphere in which the one or more available nodes are allocated in a spherical volume such that a first task is placed in a center node of the sphere with remaining tasks placed on nodes surrounding the center node so as to minimize the distance between the first task and the remaining tasks, wherein the remaining tasks communicate with the first task, but not with each other; and

    executing the optimum job in the available space in the virtual cluster of nodes.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×