×

Resilient programming frameworks for handling failures in parallel programs

  • US 9,652,336 B2
  • Filed: 03/13/2015
  • Issued: 05/16/2017
  • Est. Priority Date: 03/13/2015
  • Status: Active Grant
First Claim
Patent Images

1. An information processing system capable of supporting resilient execution of applications written in a programming language with exception handling, the information processing system comprising:

  • memory;

    persistent memory for storing data and computer instructions;

    a resilient store, communicatively coupled with the memory and the persistent memory, wherein application state information stored in the resilient store can be accessed in response to detection of a failure of an application executing in the information processing system;

    a resilient executor, communicatively coupled with the memory and the persistent memory, for executing computations of applications while detecting failures in the execution of the computations; and

    a processor, communicatively coupled with the resilient executor, the resilient store, the memory, the persistent memory, and wherein the processor, responsive to executing computer instructions, performs operations comprising;

    periodically checkpointing an application state in the resilient store;

    executing, with the resilient executor, computations of the application while detecting failures in the execution of the computations, wherein the resilient executor includes computer code which is part of the application;

    restoring, based on the resilient executor detecting a failure in the execution of a computation of the application by catching with said computer code at least one exception, application state information for the application from a checkpoint in the resilient store; and

    resuming, with the resilient executor, execution of the computation of the application with the restored application state information.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×