×

Functionality of decomposition data skew in asymmetric massively parallel processing databases

  • US 9,355,127 B2
  • Filed: 10/12/2012
  • Issued: 05/31/2016
  • Est. Priority Date: 10/12/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of restructuring a table having data skew in a computer system, the computer system storing data from a database in partitions on one or more nodes of the computer system, the method comprising:

  • determining whether original data values of a distribution key column of the table include frequent data values that cause data skew in the table;

    after the original data values of the distribution key column have been determined to include the frequent data values, copying only the original data values of the distribution key column that comprise the frequent data values to a switch column added to the table;

    after the original data values of the distribution key column that comprise the frequent data values have been copied to the switch column, replacing only the original data values in the distribution key column that comprise the frequent data values with modified data values that reduce the data skew in the table during partitioning, wherein the original data values that are copied and replaced comprise a subset of the original data values and the subset of the original data values comprises one or more of the frequent data values that cause the data skew in the table;

    after the original data values in the distribution key column that comprise the frequent data values have been replaced, partitioning the rows of the table across the nodes of the computer system using the distribution key column with the modified data values; and

    performing database operations other than the partitioning using the original data values, but not the modified data values.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×