Autonomic monitoring in a grid environment
First Claim
1. A system for performing autonomic monitoring of objects in a networked computing grid having a plurality of resources for executing a plurality of jobs, comprising:
- a configuration module for receiving information on one or more objects to be monitored and associated exception conditions for the one or more objects, the exception conditions being defined by parameters associated with job execution on the grid;
an information collection module in communication with the configuration module, the information collection module being operable to collect job execution information for the one or more objects to be monitored; and
an exception module in communication with the information collection module, the exception module being operable to identify the existence of the one or more exception conditions by evaluating the job execution information for the one or more objects to be monitored.
5 Assignments
0 Petitions
Accused Products
Abstract
A system for performing autonomic monitoring in a computing grid is described. The system includes a plurality of modules, which when implemented into a computing grid, are operable to analyze objects of the grid and identify exception conditions associated with the objects. The system includes a configuration module for receiving information on specified objects to be monitored and exception conditions for the objects, an information collection module to collect job execution data associated with the objects, and an exception module to evaluate the job execution data associated with the objects and identify existing exception conditions. Related methods of performing autonomic monitoring in a grid system are also described.
-
Citations
20 Claims
-
1. A system for performing autonomic monitoring of objects in a networked computing grid having a plurality of resources for executing a plurality of jobs, comprising:
-
a configuration module for receiving information on one or more objects to be monitored and associated exception conditions for the one or more objects, the exception conditions being defined by parameters associated with job execution on the grid;
an information collection module in communication with the configuration module, the information collection module being operable to collect job execution information for the one or more objects to be monitored; and
an exception module in communication with the information collection module, the exception module being operable to identify the existence of the one or more exception conditions by evaluating the job execution information for the one or more objects to be monitored. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for performing autonomic monitoring of jobs in a networked computing grid, comprising:
-
defining one or more exception conditions for one or more jobs to be executed on one or more hosts within the grid;
collecting status information on the one or more jobs during execution of the one or more jobs; and
evaluating the job status information to determine whether the one or more exception condition exists. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for performing autonomic monitoring of objects in a computing grid, comprising:
-
defining one or more exception conditions for one or more objects in the grid, the one or more exception conditions being defined by parameters associated with job execution on the grid;
collecting job execution information on the one or more objects; and
evaluating the job execution information to determine whether the one or more exception conditions exist. - View Dependent Claims (17, 18, 19, 20)
-
Specification