Dynamic process recovery in a distributed environment
First Claim
1. A method of process recovery in a distributed environment, comprising:
- detecting a process heartbeat associated with a first instance of a process, said process having services;
determining that said process has failed if said heartbeat was not received at a predetermined next heartbeat time;
releasing resources associated with said process;
restarting a second instance of said process and said services from a last known good state of said first instance of said process.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods for providing dynamic process recovery that supports recovery of event driven services. The present invention addresses situations where a service-hosting process fails as well as all of the services which were running in that process. To recover from this scenario, the present invention restarts the services a different process from a last know good state. The present invention also provides for situations where a process is shut down in a controlled manner, but the services running within it are still able to drive. Here again, the services are restarted in a secondary process. The recovery modal described here, therefore, advantageously addresses a problem in the design of a distributed event-driven environment.
9 Citations
20 Claims
-
1. A method of process recovery in a distributed environment, comprising:
-
detecting a process heartbeat associated with a first instance of a process, said process having services;
determining that said process has failed if said heartbeat was not received at a predetermined next heartbeat time;
releasing resources associated with said process;
restarting a second instance of said process and said services from a last known good state of said first instance of said process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer readable medium having computer-readable instructions stored thereon for process recovery in a distributed environment, said computer-readable instructions performing the method comprising:
-
detecting a process heartbeat associated with a first instance of a process, said process having services;
determining that said process has failed if said heartbeat was not received at a predetermined next heartbeat time;
releasing resources associated with said process;
restarting a second instance of said process and said services from a last known good state of said first instance of said process. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A distributed computing environment, comprising:
-
a first server running a process, said process having services;
a second server having a backup process, said backup process is capable of running services if the first server is unavailable; and
a database server that tracks a heartbeat from said first instance of said process, wherein if it is determined said process has failed because said heartbeat was not received at a predetermined next heartbeat time, said database server releases resources associated with said process and restarts said services in said backup process from a last known good state of said first instance of said process. - View Dependent Claims (18, 19, 20)
-
Specification