×

Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network

  • US 6,195,760 B1
  • Filed: 07/20/1998
  • Issued: 02/27/2001
  • Est. Priority Date: 07/20/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer system for fault tolerant computing comprising:

  • a plurality of host computers interconnected on a network;

    one or more copies of an application module each running on a different one of said plurality of host computers;

    one or more idle backup copies of the application module each stored on a different one of said host computers;

    a manager daemon process running on one of said plurality of host computers, the manager daemon process receiving an indication upon a failure of one of said running copies of the application module and initiating failure recovery; and

    means for providing a registration message to said manager daemon process, said registration message specifying said application module and a degree of replication of said application module, said degree of replication indicating the number of running copies of the application module to be maintained in the system;

    wherein the number of running copies of the application module is maintained at the registered degree of replication by executing at least one of said idle backup copies upon detecting one or more failures, respectively, of any of the running copies of said application module.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×