×

Automatic repair of corrupt HBases

  • US 9,842,126 B2
  • Filed: 03/15/2013
  • Issued: 12/12/2017
  • Est. Priority Date: 04/20/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for maintaining table integrity of a datastore table in a distributed data cluster that relies on the datastore table for locating data, the datastore table having rows and being partitioned into regions, each region having a start key and a stop key for identifying which rows map to which region, the distributed data cluster including (1) a number of region servers, each region server maintaining one or more of the regions and (2) a distributed file system (DFS) that stores the data, the method comprising:

  • identifying, by scanning all rows in the datastore table, whether each possible row in the datastore table maps to one and only one region;

    upon identifying that a particular row does not map to one and only one region, determining that a table integrity problem exists;

    determining a type of the table integrity problem;

    deciding a repair option based on the type of the table integrity problem, wherein the repair option is to cause the particular row to become mapped to one and only one correct region, in consistency with the data stored in the DFS; and

    resolving the table integrity problem by executing the repair option.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×