Automatic reconnection of partner software processes in a fault-tolerant computer system
First Claim
1. A method for automatically reconnecting software processes that share an interface after one or more said software processes, known as partner processes, fails over to a redundant backup copy, said method comprising ofa. first means for registering each copy of said partner processes with a join manager component when said copy of said partner processes finishes initializing b. second means for controlling the primary or backup status of each said copy of said partner processes that executes on one or a plurality of processing units in a computer system c. third means for said partner processes requesting that said join manager create or release an association between two or more said partner processes, said association being known as a join d. fourth means for detecting the failure of the primary or backup copy of said partner process associated with said join e. fifth means for notifying the remaining partner processes associated with said join of the failure of said primary copy of said partner process associated with said join f. sixth means for said join manager notifying said remaining partner processes when a replacement primary copy of said failed partner process is restored to service whereby said partner processes do not have to be aware of the redundancy or replication strategy used by other said partner processes associated with said join, which may be different for each said partner process associated with said join, and whereby said automatic reconnection of said partner processes is independent of the hardware or operating system support for such reconnection.
3 Assignments
0 Petitions
Accused Products
Abstract
In a fault-tolerant system employing software fault-tolerance techniques, automatic reconnection of partner processes after a failure of one or more of the partner processes requires that the partner processes are aware of the redundancy strategy employed by their partners. The present invention insulates the partner processes from the need for this knowledge, and also enables automatic reconnection between partner processes running in different parts of a heterogeneous distributed system.
-
Citations
26 Claims
-
1. A method for automatically reconnecting software processes that share an interface after one or more said software processes, known as partner processes, fails over to a redundant backup copy, said method comprising of
a. first means for registering each copy of said partner processes with a join manager component when said copy of said partner processes finishes initializing b. second means for controlling the primary or backup status of each said copy of said partner processes that executes on one or a plurality of processing units in a computer system c. third means for said partner processes requesting that said join manager create or release an association between two or more said partner processes, said association being known as a join d. fourth means for detecting the failure of the primary or backup copy of said partner process associated with said join e. fifth means for notifying the remaining partner processes associated with said join of the failure of said primary copy of said partner process associated with said join f. sixth means for said join manager notifying said remaining partner processes when a replacement primary copy of said failed partner process is restored to service whereby said partner processes do not have to be aware of the redundancy or replication strategy used by other said partner processes associated with said join, which may be different for each said partner process associated with said join, and whereby said automatic reconnection of said partner processes is independent of the hardware or operating system support for such reconnection.
Specification