Grid computing accounting and statistics management system
First Claim
1. A method of managing a grid computing environment comprising a plurality of resources, said method including steps of:
- determining currently executing and scheduled job portions having particular characteristics for said plurality of resources;
performing job portions of a data processing request using respective heterogeneous and distributed resources of said grid computing environment, said resources including a plurality of platforms having different implementations, semantic behaviors and application programming interfaces and which may exhibit different performance for a given portion of a given job;
periodically capturing and storing as data records, in correspondence with a plurality of characteristics of a respective job portion, a snapshot of current instantaneous operations of individual application environments in said grid computing environment from said respective resources performing said job portions of said data processing request in content-addressable storage, said data records corresponding to performance of respective ones of said respective resources while processing respective ones of said job portions, said performance being monitored in correspondence with said characteristics of respective ones of said job portions wherein said plurality of characteristics include nature of processing being performed during said instantaneous operation, said platform on which said processing is performed, said resource or resources on which said processing is performed and concurrent processing in said grid environment, such that any performance data or other parameter of said data records for respective ones of said resources may be retrieved based on any other performance data or parameter of said data records or logical combination thereof with arbitrarily fine granularity to closely match characteristics of job portions of any other job to assess impact of said other job on said grid environment, determine allocation of said resources to said portions of said other job or predict performance of said other job as performed on available resources;
storing actual performance and predicted performance of said other job as performed by ones of said resources allocated to said other job;
selecting data records in accordance with one or more of said plurality of characteristics stored in said data records as selected data records;
retrieving said selected data records; and
processing data retrieved in said retrieving step to produce processed performance data as historical performance statistics corresponding to said characteristics of said job portions and said resources.
1 Assignment
0 Petitions
Accused Products
Abstract
Performance data is captured periodically from resources and groups of resources in a grid computing environment and stored in a content-addressable data repository from which it can be accessed in response to an arbitrarily complex query in regard to specifics of particular jobs or job portions, particular resources utilized, grid architecture, application environment, concurrent jobs or job portions and the like. The data repository may be distributed or divided in regard to grid environment architecture, security domains or the like and each portion or division may be implemented in a modular fashion including an accounting and statistics management module and additional modules or computing engines for performing particular desired analyses or functions. Results of such analyses or functions may be communicated to a grid workload agent (and associated modules) to improve grid management on a fine-grained basis.
16 Citations
16 Claims
-
1. A method of managing a grid computing environment comprising a plurality of resources, said method including steps of:
-
determining currently executing and scheduled job portions having particular characteristics for said plurality of resources; performing job portions of a data processing request using respective heterogeneous and distributed resources of said grid computing environment, said resources including a plurality of platforms having different implementations, semantic behaviors and application programming interfaces and which may exhibit different performance for a given portion of a given job; periodically capturing and storing as data records, in correspondence with a plurality of characteristics of a respective job portion, a snapshot of current instantaneous operations of individual application environments in said grid computing environment from said respective resources performing said job portions of said data processing request in content-addressable storage, said data records corresponding to performance of respective ones of said respective resources while processing respective ones of said job portions, said performance being monitored in correspondence with said characteristics of respective ones of said job portions wherein said plurality of characteristics include nature of processing being performed during said instantaneous operation, said platform on which said processing is performed, said resource or resources on which said processing is performed and concurrent processing in said grid environment, such that any performance data or other parameter of said data records for respective ones of said resources may be retrieved based on any other performance data or parameter of said data records or logical combination thereof with arbitrarily fine granularity to closely match characteristics of job portions of any other job to assess impact of said other job on said grid environment, determine allocation of said resources to said portions of said other job or predict performance of said other job as performed on available resources; storing actual performance and predicted performance of said other job as performed by ones of said resources allocated to said other job; selecting data records in accordance with one or more of said plurality of characteristics stored in said data records as selected data records; retrieving said selected data records; and processing data retrieved in said retrieving step to produce processed performance data as historical performance statistics corresponding to said characteristics of said job portions and said resources. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for managing a grid computing environment comprising:
-
a plurality of heterogenous and distributed data processing resources, wherein respective ones of said data processing resources include a plurality of platforms having different implementations, semantic behaviors and application programming interfaces and which may exhibit different performance for a given portion of a given job; a grid management system including a grid workload agent to monitor respective ones of said data processing resources of said grid computing environment processing job portions of data processing requests, wherein said job portions have particular characteristics; content-addressable storage means for periodically storing data records forming a snapshot of current instantaneous operations of individual application environments in said grid computing environment which include, in correspondence with a plurality of characteristics of a respective job portion, performance data from respective resources monitored by said grid workload agent, wherein said plurality of characteristics include nature of processing being performed, said platform on which said processing is performed during said instantaneous operation, said resource or resources on which said processing is performed and concurrent processing in said grid environment, said performance data including data identifying a corresponding monitored respective resource while that resource is processing a respective one of said job portions of data processing requests in correspondence with said characteristic of a respective one of said job portions such that any performance data or other parameter of said data records for respective ones of said resources may be retrieved based on any other performance data or parameter of said data records or logical combination thereof with arbitrarily fine granularity to closely match characteristics of job portions of any other job to assess impact of said other job on said grid environment, determine allocation of said data processing resources to said portions of said other job or predict performance of said other job as performed on available ones of said data processing resources; a memory for storing actual performance and predicted performance of said other job as performed by ones of said data processing resources allocated to said other job; a content-addressable memory for retrieving said actual performance data on the basis of a said characteristic or combination of said characteristics, and processing data stored by said content-addressable memory to form processed performance data; and at least one module cooperating with said grid workload agent to manage said grid computing environment based on said processed performance data. - View Dependent Claims (12, 13, 14, 15, 16)
-
Specification