×

EFFICIENT JOIN WITH ONE OR MORE LARGE DIMENSION TABLES

  • US 20140214796A1
  • Filed: 01/31/2013
  • Published: 07/31/2014
  • Est. Priority Date: 01/31/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for efficient hash joins with a large dimension table, comprising:

  • gathering table characteristics to support a query, the gathering including identifying a first set of at least one dimension table having a hash table size estimated to exceed a defined threshold, and identifying a second set of remaining tables having at least one local predicate;

    executing a query responsive to the gathered table characteristics, the query execution including;

    reducing each identified large dimension table in the first set, including;

    building a join filter from each dimension table of the second set;

    scanning a fact table and reducing the fact table by applying predicates local to the fact table, and applying each join predicate to each table in the second set by probing its corresponding join filter; and

    for each large dimension table, creating a pre-join output filter from columns of the reduced fact table that join with each large dimension table;

    joining all the dimension tables and the fact table in a star join while exploiting each pre-join output filter, including;

    scanning each large dimension table of the first set, applying the created pre-join output filter as a local predicate during the scan, and for each qualifying row of the dimension table, creating an entry in a hash table;

    scanning each dimension table in the second set, including applying predicates local to each remaining dimension table, and for each qualifying row of each remaining dimension table, creating an entry in an associated hash table; and

    joining the fact table with all dimension tables, including scanning the fact table and probing each corresponding hash table for each reduced dimension table.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×