×

System and method for cluster management based on HPC architecture

  • US 9,832,077 B2
  • Filed: 11/02/2015
  • Issued: 11/28/2017
  • Est. Priority Date: 04/15/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • identifying, by one or more first hardware processors of a cluster management engine, which nodes in a virtual cluster are available, the virtual cluster including a plurality of communicatively coupled nodes, each node including a cluster agent in communication with the cluster management engine, and the virtual cluster comprising a logical grouping of nodes configured to process jobs;

    identifying, by one or more of the one or more first hardware processors or one or more second hardware processors, a job of the jobs that is compatible with the identified available nodes including;

    selecting the job from a job queue;

    identifying a shape that sufficiently matches a shape of the job, the shape indicating a specific topology of a sub-cluster of nodes from the cluster of nodes suitable to execute the selected job;

    identifying one or more shapes of the available nodes including one or more of (1) a cube in which nodes of the available nodes are allocated in a logical cubic volume so as to allow tasks of the job to exchange data with other tasks of the job so as to minimize a distance between nodes that exchange data, and (2) a sphere in which nodes of the available nodes are allocated in a logical spherical volume so as to allow a first task of the job to be placed in a center node of the sphere with remaining tasks of the job placed on nodes surrounding the center node so as to minimize a distance between the first task and the remaining tasks; and

    determining whether the available nodes are sufficient to execute the job based on the identified shape that sufficiently matches the shape of the job and the one or more identified shapes of the available nodes;

    in response to determining whether the available nodes are sufficient, allocating, by the one or more of the one or more first hardware processors, the one or more second hardware processors, or one or more third hardware processors, a plurality of the available nodes for the job that sufficiently fit the identified shape of the job; and

    executing the job using the allocated nodes.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×