×

TEMPLATE BASED PARALLEL CHECKPOINTING IN A MASSIVELY PARALLEL COMPUTER SYSTEM

  • US 20080215916A1
  • Filed: 04/16/2008
  • Published: 09/04/2008
  • Est. Priority Date: 04/14/2005
  • Status: Active Grant
First Claim
Patent Images

1. ) A massively parallel computer system comprising:

  • a) a plurality of compute nodes;

    b) a plurality of input/output (I/O) nodes connected to the compute nodes;

    c) a network that connects the compute nodes and the I/O nodes that supports a broadcast communication with the compute nodes; and

    d) a checkpoint server that collects a parallel checkpoint of the state of the compute nodes using a rolling checksum algorithm and broadcasts a list of data block checksums for a plurality of data blocks on the server from a previous checkpoint to the plurality of compute nodes and the compute nodes search their own memory image for checksum matches using the rolling checksum algorithm.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×