×

Dynamic replica failure detection and healing

  • US 9,971,823 B2
  • Filed: 04/04/2016
  • Issued: 05/15/2018
  • Est. Priority Date: 06/13/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • a plurality of compute nodes, each comprising at least one processor and memory, wherein the plurality of compute nodes implement a data store;

    wherein the data store is configured to;

    maintain a plurality of replicas of data on behalf of a client of the data store at different ones of the compute nodes as a replica group for the data;

    obtain individual metadata for different replicas of the replica group to update status metadata stored for the replica group at one or more of the compute nodes remote from the different ones of the compute nodes that maintain the plurality of replicas;

    access, by a replica group status sweeper remote from the different ones of the compute nodes and remote from the one or more compute nodes that store the status metadata, the updated status metadata for the replica group at the one or more compute nodes to evaluate the replica group for compliance with a healthy state definition of a number of replicas for the replica group based, at least in part, on the updated status metadata, wherein the evaluation determines that a number of available replicas for the replica group is not compliant with the healthy state definition; and

    automatically restore the replica group such that the number of available replicas for the replica group is compliant with the healthy state definition for the replica group.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×