×

Query execution systems and methods

  • US 8,935,232 B2
  • Filed: 02/22/2011
  • Issued: 01/13/2015
  • Est. Priority Date: 06/04/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing data in a database system containing a plurality of nodes, the method comprising the steps of:

  • receiving a query for processing of data, wherein the data is stored in a first table in a plurality of tables, wherein the first table is stored on at least one node within the database system;

    determining an attribute of the first table and second table in the plurality of tables, the attribute including a join key, the first table having a smaller size than the second table, wherein the second table is partitioned into a plurality partitions, and at least one partition of the second table is stored on at least one node within the database system;

    providing the first table to each node in the database system storing at least one partition of the second table; and

    joining, on each node storing at least one partition of the second table, the first table and at least one partition of the second table using the determined attribute;

    whereina map phase in a plurality of map phases of a MapReduce process includes the determining, the providing, and the joining, the MapReduce process including a plurality of map phases and reduce phases, anda reduce phase in the plurality of reduce phases includes aggregating data generated as a result of the joining and, optionally, performing at least one operation on the generated data;

    each partition of the second table is joined with the first table during a different map processing task in the plurality of map processing tasks of a MapReduce process.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×