×

Partition-based index management in hadoop-like data stores

  • US 9,959,306 B2
  • Filed: 06/12/2015
  • Issued: 05/01/2018
  • Est. Priority Date: 06/12/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for maintaining an index of a processing a dataset in a partitioned distributed storage system after a batch update of a dataset, the partitioned distributed storage system having data stored in a base table and an index stored in an index table, comprising:

  • locking the base table and the index table to prevent external update operations;

    receiving base and index table metadata from the partitioned distributed storage system, wherein the base table metadata and the index table metadata include respective base table partition information and index table partition information;

    partitioning the dataset into a set of base-delta files according to the base table metadata;

    generating a set of index-delta files corresponding with the base-delta files according to the index table metadata;

    updating the partitioned distributed storage system with the set of base-delta files and the set of index-delta files,wherein a first update of the base table is synchronous with a second update of the index table; and

    unlocking, subsequent to the update, the base table and the index table.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×