Distributed kernel operating system
First Claim
1. A method of managing communication between multiple nodes in a distributed message-passing operating system comprising:
- establishing by a media selection layer, a single node-to-node connection between a local node and a single remote node, the single node-to-node connection associated with a local connection identifier and a remote connection identifier, the local connection identifier associated with the local node and the remote connection identifier associated with the remote node, wherein one or more user-level connections are established for transmission of data over the single node-to-node connection;
maintaining, at the local node, a pool of available communications networks that interconnect the local node with the single remote node, wherein the pool of available communications networks are utilized for the transmission of the data over the single node-to-node connection and the one or more user-level connections;
determining if one of the available communication networks has failed;
removing the failed communication network from the pool;
transmitting, periodically across the failed communication network, maintenance packets from the local node to the single remote node; and
determining whether the failed communication network has recovered.
14 Assignments
0 Petitions
Accused Products
Abstract
An improved distributed operating system over a network of computer systems is described. Existing distributed operating systems have transmission performance limitations dictated by their inability to (1) reliably handle transient communication failures and rapid node reboots, (2) provide a transmission protocol that adapts to link reliability, and (3) allow transmissions to occur over an arbitrary combination of communication links. The systems and methods described herein solve these problems by providing a reliable node-to-node session protocol that offers high performance message delivery and multi-interface management and support. This is done by transmitting all data between two nodes of the operating system over a single connection that may dynamically exploit multiple interfaces between the nodes.
137 Citations
14 Claims
-
1. A method of managing communication between multiple nodes in a distributed message-passing operating system comprising:
-
establishing by a media selection layer, a single node-to-node connection between a local node and a single remote node, the single node-to-node connection associated with a local connection identifier and a remote connection identifier, the local connection identifier associated with the local node and the remote connection identifier associated with the remote node, wherein one or more user-level connections are established for transmission of data over the single node-to-node connection; maintaining, at the local node, a pool of available communications networks that interconnect the local node with the single remote node, wherein the pool of available communications networks are utilized for the transmission of the data over the single node-to-node connection and the one or more user-level connections; determining if one of the available communication networks has failed; removing the failed communication network from the pool; transmitting, periodically across the failed communication network, maintenance packets from the local node to the single remote node; and determining whether the failed communication network has recovered. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
Specification