×

Efficient query processing in columnar databases using bloom filters

  • US 8,972,337 B1
  • Filed: 02/21/2013
  • Issued: 03/03/2015
  • Est. Priority Date: 02/21/2013
  • Status: Active Grant
First Claim
Patent Images

1. A distributed data warehouse system, comprising:

  • a plurality of nodes, wherein at least some nodes of the plurality of nodes each comprise;

    storage for a columnar database table, wherein said storage comprises a plurality of data blocks;

    a bloom filter generator, configured to;

    generate a bloom filter for each of one or more data blocks storing data for a column of the columnar database table, wherein each bloom filter is represented as a bitmap, wherein different patterns of set bits in the bitmap indicate data values not stored in the data block;

    a read module;

    a query engine, configured to;

    receive an indication of a query directed to the column of the columnar database table for select data;

    evaluate the indication of the query to determine one or more predicate data values that identify the select data;

    in response to receiving and evaluating the indication of the query;

    analyze the bitmap representing the bloom filter for the one or more predicate data values for each of the one or more data blocks to determine particular ones of the one or more data blocks which do not need to be read in order to service the query for the select data; and

    direct the read module to read the one or more data blocks storing data for the column excepting the particular ones of the one or more data blocks which do not need to be read.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×