×

Inter-processor failure detection and recovery

  • US 8,850,262 B2
  • Filed: 10/12/2010
  • Issued: 09/30/2014
  • Est. Priority Date: 10/12/2010
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus comprising:

  • non-transitory computer readable storage medium storing computer readable prongram code executable by a plurality of centaral processing units (CPU), wherein the plurality of CPUs are configured in a ring and each CPUn determines whether a CPUn+1 that is logically adjacent to the CPUn in the ring has failed, the computer readable program code comprising;

    a retrieval module of the CPUn configured to retrieve a timestampn+1 from a shared memory that is shared by the plurality of CPUs, wherein the timestampn+1 is written to the shared memory by the CPUn+1, wherein the CPUn is a first core in a multi-core processor and the CPUn+1 is a second core in a multi-core processor, the multi-core processor comprising a plurality of cores;

    a comparison module of the CPUn configured to compare the timestampn+1 to a timestampn generated by a CPUn checking the CPUn+1 for failure and determine a delta value;

    the comparison module of the CPUn further configured to compare the delta value with a threshold value and determine whether the CPUn+1 has failed; and

    a detection module of the CPUn configured to, in response to the comparison module determining that the CPUn+1 has failed, initiate error handling for the plurality of CPUs.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×