Link aggregation split-brain detection and recovery
First Claim
Patent Images
1. A network system comprising:
- a set of peers within a link aggregation group (LAG), the set of peers including a first network element and a second network element; and
a status resolution server to connect to the set of peers within the link aggregation group, wherein a subset of the set of peers within the LAG is to query the status resolution server to determine an operational status of another peer in the set of peers in response to detection of an apparent failure of that peer by the subset of the set of peers, the status resolution server is to receive a query from the second network element and determine an operational status of the first network element, wherein the second network element is part of the subset of the set of peers, the LAG is a multi-chassis LAG, the status resolution server is a network management device including a link aggregation agent, the link aggregation agent configured to access a remote connection to a central system database of the first network element to determine the operational status of the first network element.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments are described herein that provide a network system comprising a set of peers within a link aggregation group (LAG), the first set of peers including a first network element and a second network element and a status resolution server to connect to the set of peers within the link aggregation group, wherein one or more peers within the LAG is to query the status resolution server to determine an operational status of a peer in the set of peers in response to detection of an apparent failure of the peer.
47 Citations
33 Claims
-
1. A network system comprising:
-
a set of peers within a link aggregation group (LAG), the set of peers including a first network element and a second network element; and a status resolution server to connect to the set of peers within the link aggregation group, wherein a subset of the set of peers within the LAG is to query the status resolution server to determine an operational status of another peer in the set of peers in response to detection of an apparent failure of that peer by the subset of the set of peers, the status resolution server is to receive a query from the second network element and determine an operational status of the first network element, wherein the second network element is part of the subset of the set of peers, the LAG is a multi-chassis LAG, the status resolution server is a network management device including a link aggregation agent, the link aggregation agent configured to access a remote connection to a central system database of the first network element to determine the operational status of the first network element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory machine readable medium storing instructions which, when executed by one or more control plane processors of a network element, cause the network element to perform operations comprising:
-
configuring the network element as a first peer of a link aggregation group (LAG), the first peer coupled to a second peer via a peer link including one or more network connections, and wherein the LAG is a multi-chassis LAG; in response to a loss of communication over the peer link, assuming operations of the second peer by the network element, and querying a status resolution server connected by the network element via a network to determine an operational status of the second peer, wherein querying the status resolution server includes querying a link aggregation agent of a network management device; and resuming operation as the first peer in response to receiving an indication from the status resolution server that the second peer is operational. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A network element comprising:
-
a control plane including one or more processors and memory, wherein the control plane is to configure the network element as a first peer of a link aggregation group (LAG), wherein the LAG is a multi-chassis LAG; and a data plane coupled to the control plane, the data plane including a network interface having one or more peer ports and one or more non-peer ports, the one or more peer ports to couple to a second peer of the LAG and the one or more non-peer ports to couple to an external device, wherein in response to a loss of communication with the second peer via the one or more peer ports, the control plane is to cause the network element to assume operational duties of the second peer and query a status resolution server for an operational status of the second peer, wherein the status resolution server is a network management device including a link aggregation agent. - View Dependent Claims (22, 23, 24)
-
-
25. A network system comprising:
-
a link aggregation group (LAG) including multiple network elements, the multiple network elements interconnected via a peer link including one or more network connections; a status resolution server to monitor operational status of network elements of the LAG and a link state of the peer link; and wherein the status resolution server is further to detect occurrence of a split brain scenario caused by a failure of the peer link and automatically re-configure one or more network elements of the LAG to resolve the split brain scenario by indicating to each network element coupled to the peer link an operational status of other network elements coupled to the peer link and to establish a communication link between the network elements of the LAG to restore communication lost by the failure of the peer link. - View Dependent Claims (26)
-
-
27. A method of managing operations of one or more network elements of a link aggregation group (LAG), the method comprising:
-
monitoring operational status of one or more network elements of the LAG, wherein the LAG is a multi-chassis LAG; monitoring a link state of a peer link, the peer link including one or more network connections between the network elements of the LAG; detecting occurrence of a split brain scenario caused by a failure of the peer link; and automatically re-configuring one or more network elements of the LAG to resolve the split brain scenario by indicating to each network element coupled to the peer link an operational status of other network elements coupled to the peer link; and establishing a communication link between the network elements of the LAG to restore communication lost by the failure of the peer link. - View Dependent Claims (28, 29, 30, 31, 32, 33)
-
Specification