×

Data arrangement management in a distributed data cluster environment of a shared pool of configurable computing resources

  • US 10,387,415 B2
  • Filed: 06/28/2016
  • Issued: 08/20/2019
  • Est. Priority Date: 06/28/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for data arrangement management in a distributed data cluster environment of a shared pool of configurable computing resources, the method comprising:

  • monitoring, in the distributed data cluster environment, a set of data for a data redistribution candidate trigger;

    detecting, in the distributed data cluster environment, the data redistribution candidate trigger with respect to the set of data, wherein detecting the data redistribution candidate trigger comprises;

    detecting a data structure which indicates a workload pattern;

    building a new distribution key for the data structure to change the workload pattern to reduce data movement during a query operation;

    determining, based on the new distribution key, a new data arrangement associated with the set of data, and comparing the new data arrangement with a current data arrangement to determine which data arrangement is more efficient based on resource usage; and

    in response to determining that the new data arrangement is more efficient than the current data arrangement, establishing, based on the new distribution key, the new data arrangement in the distributed data cluster environment such that at least a portion of the set of data comprises a different physical location in the new data arrangement.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×