×

System and method for distributed database query engines

  • US 9,361,344 B2
  • Filed: 06/02/2015
  • Issued: 06/07/2016
  • Est. Priority Date: 01/07/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • a gateway server configured to generate a plurality of partial queries from a database query for a database containing data stored in a distributed storage cluster that has a plurality of data nodes, and to construct a query result based on a plurality of intermediate results; and

    a plurality of worker nodes, the worker nodes being separate from the data nodes, wherein each worker node of the plurality of worker nodes is configured to process a respective partial query of the plurality of partial queries by scanning data related to the respective partial query and stored on at least one data node of the distributed storage cluster, and wherein each worker node of the plurality of worker nodes is further configured to generate an intermediate result of the plurality of intermediate results that is stored in a memory of that worker node;

    wherein the gateway server is further configured to identify a straggling worker node and further divide a partial query that is assigned to the straggling worker node into a plurality of subordinate partial queries and assign the plurality of subordinate partial queries to some of the plurality of worker nodes, and wherein the partial query is divided into the subordinate partial queries based on quantity and location information of input file blocks of the query.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×