LIVE ERROR RECOVERY
0 Assignments
0 Petitions
Accused Products
Abstract
A packet is identified at a port of a serial data link, and it is determined that the packet is associated with an error. Entry into an error recovery mode is initiated based on the determination that the packet is associated with the error. Entry into the error recovery mode can cause the serial data link to be forced down. In one aspect, forcing the data link down causes all subsequent inbound packets to be dropped and all pending outbound requests and completions to be aborted during the error recovery mode.
-
Citations
56 Claims
-
1-33. -33. (canceled)
-
34. An apparatus comprising:
-
a capability structure associated with an downstream port error containment mode; and a downstream port comprising; input/output (I/O) circuitry to support communication with another device over a serial data link; and error logic comprising hardware circuitry, wherein the error logic is to; determine an uncorrectable error associated with a packet; determine that a particular bit is set within the capability structure to indicate that the downstream port error containment mode is enabled for the downstream port, wherein the port error containment mode is to contain uncorrectable errors at the downstream port; set a downstream port error containment status bit to trigger the downstream port error containment mode based at least in part on the particular bit set to indicate that the downstream port error containment mode is enabled; halt traffic downstream from the downstream port in the downstream port error containment mode to avoid spread of data corruption associated with the uncorrectable error and to permit error recovery; and detect that the downstream port error containment status bit is cleared; wherein the I/O logic is to attempt to retrain the link based on clearing of the downstream port error containment status bit. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A method comprising:
-
receiving, over an interconnect, a packet at a device; detecting, at a downstream port of the device, an uncorrectable error associated with the packet; determining, from a particular bit in an extended capability structure of the device, that a downstream port error containment mode is enabled for the downstream port, wherein the downstream port error containment mode is to contain uncorrectable errors at the downstream port; setting a downstream port error containment status bit to trigger the downstream port error containment mode based the downstream port error containment mode being enabled; halting traffic downstream from the downstream port in the downstream port error containment mode to avoid spread of data corruption associated with the error and to permit error recovery, wherein halting the traffic comprises sending training sequences on the link by the downstream port to force the link into a disabled link state, and the traffic is to be halted without a reset of the device; detecting that the downstream port error containment status bit is cleared; and attempting to retrain the link based on clearing of the downstream port error containment status bit.
-
-
50. A switch device comprising:
-
memory comprising a capability structure associated with a downstream port error containment mode; switching circuitry; error logic comprising hardware circuitry, wherein the error logic is to; determine an uncorrectable error associated with a packet; determine that a particular bit is set within the capability structure to indicate that the downstream port error containment mode is enabled for the downstream port, wherein the port error containment mode is to contain uncorrectable errors at the downstream port; set a downstream port error containment status bit to trigger the downstream port error containment mode based at least in part on the particular bit set to indicate that the downstream port error containment mode is enabled; halt traffic downstream from the downstream port in the downstream port error containment mode to avoid spread of data corruption associated with the uncorrectable error and to permit error recovery, wherein training sequences are to be sent on the link by the downstream port to force the link into a disabled link state and halt the traffic, and the traffic is to be halted without a reset of the downstream port; and detect that the downstream port error containment status bit is cleared; and link training logic to retrain the link based on clearing of the downstream port error containment status bit.
-
-
51. A system comprising:
-
a first device; and a second device connected to the first device by a serial data link, wherein the second device comprises; a capability structure associated with a downstream port error containment mode; and a downstream port, wherein the downstream port comprises hardware-implemented logic comprising; input/output (I/O) logic to support communication on the serial data link with the first device; error logic to; determine an uncorrectable error associated with a packet; determine that a particular bit is set within the capability structure to indicate that the downstream port error containment mode is enabled for the downstream port, wherein the port error containment mode is to contain uncorrectable errors at the downstream port; set a downstream port error containment status bit to trigger the downstream port error containment mode based at least in part on the particular bit set to indicate that the downstream port error containment mode is enabled; halt traffic downstream from the downstream port in the downstream port error containment mode to avoid spread of data corruption associated with the error and to permit error recovery, wherein training sequences are to be sent on the link by the downstream port to force the link into a disabled link state and halt the traffic, and the traffic is to be halted without a reset of the downstream port; and detect that the downstream port error containment status bit is cleared by software, wherein the I/O logic is to attempt to retrain the link based on clearing of the downstream port error containment status bit. - View Dependent Claims (52, 53, 54, 55)
-
-
56. A system comprising:
-
means for receiving a packet on an interconnect, wherein the interconnect couples a set of devices in a computer; means for detecting, at a downstream port of a particular one of the set of devices, an uncorrectable error associated with a packet; means for determining, from a particular bit in an extended capability structure of the device, that a downstream port error containment mode is enabled for the downstream port, wherein the downstream port error containment mode is to contain uncorrectable errors at the downstream port; means for setting a downstream port error containment status bit to trigger the downstream port error containment mode based the downstream port error containment mode being enabled; means for halting traffic downstream from the downstream port in the downstream port error containment mode to avoid spread of data corruption associated with the error and to permit error recovery, wherein halting the traffic comprises sending training sequences on the link by the downstream port to force the link into a disabled link state, and the traffic is to be halted without a reset of the device; means for detecting that the downstream port error containment status bit is cleared; and means for attempting to retrain the link based on clearing of the downstream port error containment status bit.
-
Specification