Implementing redundancy on infiniband (IB) networks
First Claim
1. A method of implementing redundancy for Infiniband (IB) networks, the method comprising:
- detecting a failure at a first source port of a source node having a plurality of source ports;
selecting a new source port of the source node;
determining in a first level of an address resolution protocol (ARP) cache a corresponding IP over IB global unique identifier (GUID) of a destination node to the IP address of the destination node;
locating a corresponding IB address to the IP over IB GUID of the destination node in a second level of the ARP cache;
sending an ARP announce message to one or more other nodes in communication with the source node by identifying the one or more other nodes in communication with the source node from a global ARP cache, wherein the ARP announce message includes an IP address of a second source node and an IB address of the new source port of the source node;
re-balancing host traffic across the plurality of IB nodes; and
identifying from a plurality of source ports of the source node the new source port of the source node with which to receive traffic from the one or more other nodes, after the step of detecting.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and physical computer-readable storage medium are provided to improve efficiency of systems operating on internet protocol (IP) over Infiniband (IB) networks. In an embodiment, by way of example only, a method is provided for implementing redundancy for IB networks. The method includes detecting a failure at a first source port of a source node having a plurality of source ports, selecting a new source port, sending an address resolution protocol (ARP) message to one or more other nodes in communication with the source node including an IP address of the source node and IB address of the new source port of the source node, and re-balancing host traffic.
27 Citations
15 Claims
-
1. A method of implementing redundancy for Infiniband (IB) networks, the method comprising:
-
detecting a failure at a first source port of a source node having a plurality of source ports; selecting a new source port of the source node; determining in a first level of an address resolution protocol (ARP) cache a corresponding IP over IB global unique identifier (GUID) of a destination node to the IP address of the destination node; locating a corresponding IB address to the IP over IB GUID of the destination node in a second level of the ARP cache; sending an ARP announce message to one or more other nodes in communication with the source node by identifying the one or more other nodes in communication with the source node from a global ARP cache, wherein the ARP announce message includes an IP address of a second source node and an IB address of the new source port of the source node; re-balancing host traffic across the plurality of IB nodes; and identifying from a plurality of source ports of the source node the new source port of the source node with which to receive traffic from the one or more other nodes, after the step of detecting. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An Infiniband system comprising:
-
a plurality of nodes, each node including more than one port; an IB switch connected to the plurality of nodes; and a processor configured to detect a failure at a first port of a first node; to select a new source port of the first node; to determine in a first level of an address resolution protocol (ARP) cache a corresponding IP over IB global unique identifier (GUID) of a destination node to the IP address of the destination node; to locate a corresponding IB address to the IP over IB GUID of the destination node in a second level of the ARP cache; to send an ARP announce message to one or more nodes of the plurality of IB nodes in communication with the first node by identifying the one or more other nodes in communication with the source node from a global ARP cache, wherein the ARP announce message includes an IP address of the first node and IB address of the new port of the first node; to re-balance host traffic across the plurality of IB nodes; and to identify from a plurality of source ports of the source node the new source port of the source node with which to receive traffic from the one or more other nodes, after the step of detecting. - View Dependent Claims (8, 9)
-
-
10. Executing data instructions, by a processor, embodied on a physical non-transitory computer-readable storage medium comprising a computer program product for implementing redundancy for Infiniband (IB) networks, the physical computer-readable storage medium comprising:
-
computer code for detecting a failure at a first source port of a source node having a plurality of source ports; computer code for selecting a new source port; computer code for determining in a first level of an address resolution protocol (ARP) cache a corresponding IP over IB global unique identifier (GUID) of a destination node to the IP address of the destination node; computer code for locating a corresponding IB address to the IP over IB GUID of the destination node in a second level of the ARP cache; computer code for sending an ARP announce message to one or more other nodes in communication with the source node by identifying the one or more other nodes in communication with the source node from a global ARP cache, wherein the ARP announce message includes an IP address of the source node and IB address of the new source port of the source node; computer code for re-balancing host traffic across the plurality of IB nodes; and computer code for identifying from a plurality of source ports of the source node the new source port of the source node with which to receive traffic from the one or more other nodes, after the step of detecting. - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification