System and method for supporting fault tolerant job management in a cloud platform environment
First Claim
1. A system for supporting fault tolerant job management in a cloud computing environment, comprising:
- a computer, including a cloud environment executing thereon;
one or more service management engines;
wherein each service management engine corresponds to a service type and supports provisioning of an enterprise application as a service instance of the service type on one or more clusters of the cloud environment by generating administrative commands;
a job manager that manages the execution of jobs within the cloud environment including their job states;
a job manager service that provides an application program interface which receives the administrative commands to be processed within the cloud environment as jobs;
wherein the job manager is configured to register and assign identifications to the jobs and is further configured to save or purge results of execution of the administrative commands in accordance with a retention policy; and
wherein the job manager supports fault tolerant job processing includingassociating the jobs with checkpoints,recognizing a failover command for the jobs, andassociating the jobs with work units of the administrative commands, andstoring a state for each job upon processing each work unit of the command.
1 Assignment
0 Petitions
Accused Products
Abstract
In accordance with an embodiment, described herein is a system and method for supporting fault tolerant job management for use with a cloud computing environment. In accordance with an embodiment, the system comprises a job manager that manages the execution of jobs within the cloud environment including their job states, and a job manager service that provides an application program interface which receives administrative commands to be processed within the cloud environment as jobs. The job manager supports fault tolerant job processing including associating the jobs with checkpoints, recognizing a failover command for the jobs, and associating the jobs with work units of the administrative commands, and storing a state for each job upon processing each work unit of the command.
39 Citations
15 Claims
-
1. A system for supporting fault tolerant job management in a cloud computing environment, comprising:
-
a computer, including a cloud environment executing thereon; one or more service management engines; wherein each service management engine corresponds to a service type and supports provisioning of an enterprise application as a service instance of the service type on one or more clusters of the cloud environment by generating administrative commands; a job manager that manages the execution of jobs within the cloud environment including their job states; a job manager service that provides an application program interface which receives the administrative commands to be processed within the cloud environment as jobs; wherein the job manager is configured to register and assign identifications to the jobs and is further configured to save or purge results of execution of the administrative commands in accordance with a retention policy; and wherein the job manager supports fault tolerant job processing including associating the jobs with checkpoints, recognizing a failover command for the jobs, and associating the jobs with work units of the administrative commands, and storing a state for each job upon processing each work unit of the command. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for supporting fault tolerant job management in a cloud computing environment, comprising:
-
providing, at a computer, a cloud environment; providing one or more service management engines, wherein each service management engine corresponds to a service type and supports provisioning of an enterprise application as a service instance of the service type on one or more clusters of the cloud environment by generating administrative commands; providing a job manager that manages the execution of jobs within the cloud environment including their job states; providing a job manager service that provides an application program interface which receives the administrative commands to be processed within the cloud environment as jobs; wherein the job manager is configured to register and assign identifications to the jobs and is further configured to save or purge results of execution of the administrative commands in accordance with a retention policy; and wherein the job manager supports fault tolerant job processing including associating the jobs with checkpoints, recognizing a failover command for the jobs, and associating the jobs with work units of the administrative commands, and storing a state for each job upon processing each work unit of the command. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer readable storage medium, including instructions stored thereon which when read and executed by one or more computers cause the one or more computers to perform the steps comprising:
-
providing, at a computer, a cloud environment; providing one or more service management engines, wherein each service management engine corresponds to a service type and supports provisioning of an enterprise application as a service instance of the service type on one or more clusters of the cloud environment by generating administrative commands; providing a job manager that manages the execution of jobs within the cloud environment including their job states; providing a job manager service that provides an application program interface which receives the administrative commands to be processed within the cloud environment as jobs; wherein the job manager is configured to register and assign identifications to the jobs and is further configured to save or purge results of execution of the administrative commands in accordance with a retention policy; and wherein the job manager supports fault tolerant job processing including associating the jobs with checkpoints, recognizing a failover command for the jobs, and associating the jobs with work units of the administrative commands, and storing a state for each job upon processing each work unit of the command. - View Dependent Claims (12, 13, 14, 15)
-
Specification