Detecting intermittent network link failures
First Claim
Patent Images
1. An apparatus for detecting intermittent network link failures, the apparatus comprising:
- a non-tangible computer readable storage medium storing a computer readable program;
a processor executing the computer readable program, the computer readable program comprising;
a detection module detecting link failures of a network link of a network by notifying an Ethernet data link layer of each link failure from an Ethernet physical layer, the Ethernet data link layer notifying a network agent residing at the data link layer of the link failure, and the network agent logging the link failure;
a tracking module tracking the link failures over a specified time interval, wherein a link failure comprises missing a heartbeat message during a heartbeat time interval;
a failure module determining the network link is failing in response to a number of link failures during the specified time interval exceeding a specified failure threshold, wherein the specified failure threshold t is calculated from a service level as t=ksn wherein k is a non-zero constant, s is the service level, and n is a number of heartbeat messages during the specified time interval; and
a mitigation module mitigating communications over the network link by removing the failing network link from the network.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus, system, and method are disclosed for detecting intermittent network link failures. A tracking module tracks link failures of a network link of a network over a specified time interval. A failure module determines the network link is failing in response to a number of link failures exceeding a specified failure threshold. A mitigation module mitigates communications over the network link.
-
Citations
14 Claims
-
1. An apparatus for detecting intermittent network link failures, the apparatus comprising:
-
a non-tangible computer readable storage medium storing a computer readable program; a processor executing the computer readable program, the computer readable program comprising; a detection module detecting link failures of a network link of a network by notifying an Ethernet data link layer of each link failure from an Ethernet physical layer, the Ethernet data link layer notifying a network agent residing at the data link layer of the link failure, and the network agent logging the link failure; a tracking module tracking the link failures over a specified time interval, wherein a link failure comprises missing a heartbeat message during a heartbeat time interval; a failure module determining the network link is failing in response to a number of link failures during the specified time interval exceeding a specified failure threshold, wherein the specified failure threshold t is calculated from a service level as t=ksn wherein k is a non-zero constant, s is the service level, and n is a number of heartbeat messages during the specified time interval; and a mitigation module mitigating communications over the network link by removing the failing network link from the network. - View Dependent Claims (2, 3)
-
-
4. A computer program product comprising a non-tangible computer readable storage medium storing a computer readable program executing on a processor to perform operations for detecting intermittent network link failures, the operations of the computer program product comprising:
-
tracking link failures of a network link of a network over a specified time interval by notifying an Ethernet data link layer of each link failure from an Ethernet physical layer, the Ethernet data link layer notifying a network agent residing at the data link layer of the link failure, and the network agent logging the link failure, wherein a link failure comprises missing a heartbeat message during a heartbeat time interval; determining the network link is failing in response to a number of link failures during the specified time interval exceeding a specified failure threshold, wherein the specified failure threshold t is calculated from a service level as t =ksn wherein k is a non-zero constant, s is the service level, and n is a number of heartbeat messages during the specified time interval; and mitigating communications over the network link by removing the failing network link from the network. - View Dependent Claims (5, 6, 7, 8)
-
-
9. A method for detecting intermittent network link failures, the method comprising:
-
tracking link failures of a network link of a network over a specified time interval by notifying an Ethernet data link layer of each link failure from an Ethernet physical layer, the Ethernet data link layer notifying a network agent residing at the data link layer of the link failure, and the network agent ligging the link failure, wherein a link failure comprises missing a heartbeat message during a heartbeat time interval; determining the network link is failing in response to a number of link failures during the specified time interval exceeding a specified failure threshold, wherein the specified failure threshold t is calculated from a service level as t =ksn wherein k is a non-zero constant, s is the service level, and n is a number of heartbeat messages during the specified time interval; and mitigating communications over the network link by removing the failing network link from the network. - View Dependent Claims (10)
-
-
11. A system to detect intermittent network link failures, the system comprising:
-
a network; a data processing device in communication with network, the data processing device comprising; a tracking module tracking link failures of a network link of the network over a specified time interval by notifying an Ethernet data link layer of each link failure from an Ethernet physical layer, the Ethernet data link layer notifying a network agent residing at the data link layer of the link failure, and the network agent logging the link failure, wherein a link failure comprises missing a heartbeat message during a heartbeat time interval; a failure module determining the network link is failing in response to a number of link failures during the specified time interval exceeding a specified failure threshold, wherein the specified failure threshold t is calculated from a service level as t =ksn wherein k is a non-zero constant, s is the service level, and n is a number of heartbeat messages during the specified time interval; and a mitigation module mitigating communications over the network link by removing the failing network link from the network. - View Dependent Claims (12, 13, 14)
-
Specification