×

SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR OPTIMIZATION OF MULTIPLE PARALLEL JOIN OPERATIONS ON SKEWED DATA

  • US 20100088298A1
  • Filed: 10/06/2008
  • Published: 04/08/2010
  • Est. Priority Date: 10/06/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing a multiple join operation in a parallel processing system, comprising:

  • distributing a respective set of rows of a first table, a second table, and a third table involved in a join operation to each of a plurality of processing modules, wherein the join operation comprises a join on a column of the first table, a column of the second table, and a column of the third table;

    receiving, by each of the plurality of processing modules, a list of skewed values of the column of the first table;

    redistributing, by each of the plurality of processing modules, any row of the respective set of rows that has a value of the column of the first table that does not match any of the skewed values, any row of the respective set of rows that has a value of the column of the second table that does not match any of the skewed values, and any row of the respective set of rows of the third table;

    locally maintaining, by a first processing module of the plurality of processing modules, any row of the respective set of rows that has a value of the column of the first table that matches any of the skewed values; and

    duplicating, by a second processing module of the plurality of processing modules, any row of the respective set of rows that has a value of the column of the second table that matches any of the skewed values.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×