Systems and methods for providing a quiescing protocol
First Claim
Patent Images
1. A distributed system configured to process messages within the distributed system, the distributed system comprising:
- a plurality of distributed nodes configured to permit communication among the plurality of nodes and to process messages;
at least one of the plurality of nodes acting as a coordinator, the coordinator configured to initiate a quiescing protocol and to determine whether relevant messages have been processed by the plurality of the distributed nodes.
12 Assignments
0 Petitions
Accused Products
Abstract
The systems and methods of the present invention provide a quiescing protocol. In one embodiment, nodes of a system utilize the protocol to complete processing until they reach a consistent state. In one embodiment, a coordinator initiates the quiescing process and the nodes communicate with each other to determine whether their messages have been processed and communicate with the coordinator to determine when all of the messages have been processed
173 Citations
20 Claims
-
1. A distributed system configured to process messages within the distributed system, the distributed system comprising:
-
a plurality of distributed nodes configured to permit communication among the plurality of nodes and to process messages;
at least one of the plurality of nodes acting as a coordinator, the coordinator configured to initiate a quiescing protocol and to determine whether relevant messages have been processed by the plurality of the distributed nodes. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of quiescing a set of messages processed within the system, the method comprising:
-
sending start messages to a plurality of nodes indicating that a quiescing protocol is to begin;
receiving checkpoint messages from the plurality of nodes indicating that the plurality of nodes have received their probe response messages;
sending continue messages to the plurality of nodes requesting received message information from the plurality of nodes;
receiving received message information from the plurality of nodes; and
determining that the received message information indicates that the plurality of nodes have not received any new messages. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A system for quiescing a set of messages processed within the distributed system, the system comprising:
-
means for sending start messages to a plurality of nodes indicating that a quiescing protocol is to begin;
means for receiving checkpoint messages from the plurality of nodes indicating that the plurality of nodes have received their probe response messages;
means for sending continue messages to the plurality of nodes requesting received message information from the plurality of nodes;
means for receiving received message information from the plurality of nodes; and
means for determining that the received message information indicates that the plurality of nodes have not received any new messages.
-
-
13. A method of quiescing a distributed system, the method comprising:
-
setting a counter and counting messages that have been received;
ensuring that all messages previously sent have been processed by their destination;
performing a checkpoint; and
performing a second checkpoint using the value of the counter.
-
-
14. A method of quiescing a distributed system, the method comprising:
-
receiving a start message from a coordinator indicating that a quiescing protocol is to begin;
suspending new messages;
setting a received message information to indicate that there are no messages received;
receiving messages from the plurality of nodes;
updating the received message information to indicate that a message has been received;
sending probe messages to the plurality of nodes;
receiving probe response messages from the plurality of nodes;
receiving probe messages from the plurality of nodes;
for each probe message received from the plurality of nodes, sending corresponding probe response messages to the plurality of nodes;
sending a checkpoint message to the coordinator indicating that the probe response messages have been received from the plurality of nodes;
receiving a continue message from the coordinator requesting the received message information;
sending the received message information to the coordinator; and
receiving a done message from the coordinator. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification