SIP server architecture fault tolerance and failover
First Claim
1. A computer implemented method for providing failover and fault tolerance, comprising:
- maintaining a first replica in a partition for storing state information;
maintaining an engine node that writes and reads state information to and from the first replica and uses the state information to process messages;
detecting that the first replica has failed by the engine node;
reporting the failure to a second replica in the partition; and
updating a view of the partition by the second replica in order to reflect the failure of the first replica.
2 Assignments
0 Petitions
Accused Products
Abstract
The SIP server can be comprised of an engine tier and a state tier distributed on a cluster network. Engine nodes in the engine tier can process SIP messages and can read/write state information from/to the state tier. State tier can maintain state information in a set of partitions of one or more replicas which contain duplicate information. The engine nodes can be adapted to detect and report replica failures and the replicas can in turn be adapted to detect and report engine node failures. Replicas can detect faults with an engine node if the engine node fails to poll the replica for a specified period of time and can then report the failure. The engine node can detect failures of a replica when reading or writing state information and can report the failure to another replica, which can be responsible for updating the partition view to exclude dead replicas.
130 Citations
20 Claims
-
1. A computer implemented method for providing failover and fault tolerance, comprising:
-
maintaining a first replica in a partition for storing state information; maintaining an engine node that writes and reads state information to and from the first replica and uses the state information to process messages; detecting that the first replica has failed by the engine node; reporting the failure to a second replica in the partition; and updating a view of the partition by the second replica in order to reflect the failure of the first replica. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer implemented method for providing failover and fault tolerance, comprising:
-
maintaining a replica for storing state information; maintaining an engine node that writes and reads state information to and from the replica and uses the state information to process messages; periodically polling the replica by the engine node; failing to poll by the engine node for a specified period of time; and determining that the engine node has failed by the replica upon expiration of the specified period of time. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for providing failover and fault tolerance, comprising:
-
a replica connected to a cluster network and adapted to store state information used for processing messages; and an engine node connected to the cluster network and adapted to read and write the state information to and from the first replica when processing the messages; wherein the replica is adapted to detect and report engine node failures in the cluster and the engine node is adapted to detect and report replica failures in the cluster. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification