×

Compaction policy

  • US 10,346,432 B2
  • Filed: 03/17/2016
  • Issued: 07/09/2019
  • Est. Priority Date: 03/17/2015
  • Status: Active Grant
First Claim
Patent Images

1. A server-implemented method of selecting rowsets to compact in a tablet storing data associated with a distributed file system (DFS), the method comprising:

  • restricting, by a server, each rowset in a plurality of rowsets included in the tablet to have a size less than a predetermined size threshold, wherein the each rowset stores keys covering a range that is less than or equal to a keyspace of the tablet;

    determining a height of the tablet across the keyspace, wherein the height of the tablet is based on a number of rowsets having key ranges that overlap;

    determining a rowset width of the each rowset in the keyspace of the tablet, wherein the rowset width of the each rowset is proportional to a percentage of the keyspace that is covered by the rowset;

    until a minimum operational cost is reached, iteratively calculating, by the server, an operational cost associated with compaction of two or more rowsets in the keyspace, wherein the cost is calculated by integrating the rowset width of the each rowset included in the tablet across the height of the tablet, wherein the operational cost depends on a size of the each rowset; and

    selecting, by the server, the two or more rowsets for compaction based on the two or more rowsets resulting in the minimum operational cost over the each rowset included in a workspace.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×