×

Data consistency management in large computing clusters

  • US 10,467,115 B1
  • Filed: 11/03/2017
  • Issued: 11/05/2019
  • Est. Priority Date: 11/03/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • determining a rebuild time parameter that characterizes a time for copying stored data from a first storage device to a second storage device of a computing system, the computing system comprising a plurality of computing nodes;

    determining a data loss parameter corresponding to a storage device of a computing node of the plurality of computing nodes;

    determining a storage device group having a maximum number of storage devices selected from storage devices of the plurality of computing nodes by;

    identifying a maximum data loss probability value determined based at least in part on the rebuild parameter and a data loss parameter corresponding to the storage device,incrementally adding the storage device to the storage device group, andcomparing an estimated data loss probability value of the storage device group having the storage device added against the maximum data loss probability value, wherein the maximum number of storage devices for the storage device group is determined when the estimated data loss probability value exceeds the maximum data loss probability value; and

    assigning a dataset to the storage device group, wherein the dataset and a replica of the dataset are stored in the storage device group.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×