×

Fault tolerance framework for networks of nodes

  • US 8,527,622 B2
  • Filed: 10/12/2007
  • Issued: 09/03/2013
  • Est. Priority Date: 10/12/2007
  • Status: Active Grant
First Claim
Patent Images

1. A system including instructions recorded on a non-transitory computer-readable storage medium and executable by at least one processor, the system comprising:

  • the at least one processor;

    a first message handler configured to cause the at least one processor to receive first network-related data using a first communications protocol, the first network-related data being associated with a first network of nodes, the nodes of the first network of nodes communicating with one another within the first network using the first communications protocol;

    a second message handler configured to cause the at least one processor to receive second network-related data using a second communications protocol, the second network-related data being associated with a second network of nodes, the nodes of the second network of nodes communicating with one another within the second network using the second communications protocol;

    a message transport system configured to cause the at least one processor to receive the first network-related data and the second network-related data and further configured to route the first network-related data and the second network-related data in a common communications protocol; and

    a fault manager configured to;

    construct and maintain a state model that stores events occurring in the first network of nodes and the second network of nodes, stores related state information, and stores network metadata for the first network of nodes and the second network of nodes,cause the at least one processor to receive the network-related data in the common communications protocol, anddetermine a fault associated with an operation of one or more of the first network of nodes and the second network of nodes, based on the network-related data in the common protocol and on the state model,cause the at least one processor to determine a recovery method for recovering an operation of the networks of nodes despite the fault, the recovery method being determined from among a plurality of recovery methods including one or more of triggering a workflow, performing a sensor value fusion, and deploying a service within a node of the networks of nodes; and

    cause the at least one processor to implement the determined recovery method; and

    a code distribution manager configured to determine a target node not associated with the fault within the first network of nodes and to determine a service executable for deploying a service in response to the fault and based on a mapping of the service to the target node as part of implementing the determined recovery method when the recovery method includes deploying a service; and

    a service injector configured to deploy the service executable to the target node for continued execution thereon to thereby recover the operation, wherein the service injector is selected from a plurality of service injectors as being compatible with the first communications protocol.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×