Peer to peer job monitoring and control in grid computing systems
First Claim
1. A computer system comprising:
- a centralized server having overall control of a batch or grid computing system for receiving and dispatching requests from a submitter computer to an available processing computer, said processing computer being selected by said centralized server; and
means for communicating a batch job request from the submitter computer to the processing computer via the centralized server for processing by the processing computer, said batch job request identifying a monitoring process;
wherein the monitoring process is executed at the submitter computer and the processing computer, the monitoring process establishing a communication link between the submitter computer and the processing computer without participation by the centralized server, the monitoring process being configured to communicate a processing status of the batch job request via said communication link;
wherein said centralized server is only responsible for workload balancing and job initiation and completion while other command and status information is communicated directly between the submitter computer and the processing computer that are processing their respective workloads; and
wherein the submitter computer and the processing computer use event-driven peer to peer communications between each other such that detailed status and control information is passed without overloading the centralized server and without polling.
3 Assignments
0 Petitions
Accused Products
Abstract
A solution for improved monitoring and control of jobs in grid and batch computing systems provides a centralized server'"'"'s batch manager which is only responsible for workload balancing and job initiation and completion, all other command and status information are communicated directly between the plurality of submitter'"'"'s systems and the plurality of client systems that are processing their respective workloads. The computing system and communication process utilizes event-driven peer to peer communications between submitter'"'"'s systems and client systems and enables more detailed status and control information to be passed without overloading the centralized server, and by avoiding polling also provides more immediate feedback of results. Multiple process threads are employed on both the submitter'"'"'s and client systems, and a user interface consolidates and displays results to the submitter allow commands to be sent to processes running on client batch systems.
-
Citations
11 Claims
-
1. A computer system comprising:
-
a centralized server having overall control of a batch or grid computing system for receiving and dispatching requests from a submitter computer to an available processing computer, said processing computer being selected by said centralized server; and means for communicating a batch job request from the submitter computer to the processing computer via the centralized server for processing by the processing computer, said batch job request identifying a monitoring process; wherein the monitoring process is executed at the submitter computer and the processing computer, the monitoring process establishing a communication link between the submitter computer and the processing computer without participation by the centralized server, the monitoring process being configured to communicate a processing status of the batch job request via said communication link; wherein said centralized server is only responsible for workload balancing and job initiation and completion while other command and status information is communicated directly between the submitter computer and the processing computer that are processing their respective workloads; and wherein the submitter computer and the processing computer use event-driven peer to peer communications between each other such that detailed status and control information is passed without overloading the centralized server and without polling. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification