×

Automatically building a locally managed virtual node grouping to handle a grid job requiring a degree of resource parallelism within a grid environment

  • US 7,707,288 B2
  • Filed: 01/06/2005
  • Issued: 04/27/2010
  • Est. Priority Date: 01/06/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for building virtual node groupings within a grid environment, comprising:

  • detecting a grid job at a particular grid manager from among a plurality of grid managers within a grid environment, wherein said grid job requires a particular degree of parallelism for execution, wherein a plurality of resource nodes within said grid environment are identified in physically disparate groups each managed by one from among said plurality of grid managers through a plurality of web services implemented within a web services layer extended by an open grid services infrastructure atop a grid service layer comprising at least one grid service implemented within an open grid services architecture, wherein each of said plurality of grid managers comprises a grid manager communication subsystem for communicating between said plurality of grid managers, wherein said particular grid manager locally manages a first selection of resource nodes from among said plurality of resource nodes within said grid environment within a particular physical location, wherein at least one additional local grid manager manages a second selection of resource nodes from among said plurality of resource nodes within said particular physical location, wherein at least one remote grid manager manages a third selection of resource nodes from among said plurality of resource nodes within a remote physical location;

    responsive to said particular grid manager detecting that insufficient resources are available for a required execution environment for said grid job from said first selection of resource nodes, accessing, from said plurality of grid managers through said grid manager communication subsystem, a current availability, a current wait time, a current run time, and a current cost for each of said plurality of resource nodes within said grid environment;

    responsive to detecting said second selection of resource nodes are available to build said required execution environment for said grid job from said current availability returned from said at least one additional local grid manager, calculating a total local run time from said current wait time and current run time for said second selection of resources and calculating a total local cost from said current cost for said second selection of resources for building said required execution environment with said second selection of resource nodes;

    comparing said total local run time and said total local cost with a remote time calculated from said current wait time and said current run time for said third selection of resources and a remote cost calculated from said current cost for said third selection of resources;

    responsive to determining at least one of said total local run time less than said remote time and said total local cost less than said remote cost, selecting said second selection of resource nodes from among said plurality of resource nodes to build into a virtual node grouping for said required execution environment for executing said grid job;

    building said virtual node grouping by said particular grid manager through said grid manager communication subsystem by adding an Internet Protocol address alias for said virtual node grouping to a separate network card of each of said second selection of resource nodes to acquire temporary management control over said second selection of resource nodes from said at least one additional local grid manager for a duration of execution of said grid job within said virtual node grouping;

    responsive to determining said total run time slower than said remote time and said total local cost greater than said remote cost, selecting said third selection of resource nodes to build into said virtual node grouping for said required execution environment;

    building said virtual node grouping by said particular grid manager through said grid manager communication subsystem by adding said Internet Protocol address alias for said virtual node grouping to each separate network card of each of said third selection of resource nodes to acquire temporary management control over said third selection of resource nodes from said at least one remote grid manager; and

    responsive to the grid job execution completed, deconstructing said virtual node grouping.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×