×

Embracing and exploiting data skew during a join or groupby

  • US 10,489,403 B2
  • Filed: 05/06/2015
  • Issued: 11/26/2019
  • Est. Priority Date: 10/08/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • during a query optimization for a database query involving a join operation, obtaining a distribution of data values in a join column of an inner table;

    using the distribution, identifying one or more data ranges containing skew;

    performing a cost-benefit analysis for a skew specific join scheme, wherein the cost benefit analysis is based on a tradeoff between a number of the data values in a data range of the one or more data ranges and additional overhead costs of processing the number of data values using the skew specific join scheme;

    for each data range identified as containing skew, performing the join operation using the skew specific join scheme based on the cost-benefit analysis; and

    for each data range not identified as containing skew, performing the join operation using a non-skew specific join scheme.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×