×

Optimizing pipelining result sets with fault tolerance in distributed query execution

  • US 10,366,084 B2
  • Filed: 09/15/2016
  • Issued: 07/30/2019
  • Est. Priority Date: 09/15/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer system comprising:

  • one or more hardware processors;

    system memory coupled to the one or more hardware processors, the system memory storing instructions that are executable by the one or more hardware processors;

    the one or more hardware processors configured to execute the instructions stored in the system memory to pipeline result sets with fault tolerance in distributed query execution, including the following;

    access a job graph, the job graph indicating a plurality of bubbles, each bubble including one or more supervertices from a plurality of supervertices, each supervertex including one or more vertices of a same vertex type, the job graph divided into the plurality of bubbles based on determined resource consumption for each of the plurality of supervertices and dependencies between supervertices within the plurality of supervertices; and

    execute the job graph using resources of a distributed system including;

    for a bubble in the plurality of bubbles, streaming results from one supervertex within the bubble to another supervertex within the bubble via one of;

    memory or a network connection;

    for another bubble in the plurality of bubbles, storing other results from a supervertex within the another bubble to durable storage; and

    for a further bubble in the plurality of bubbles, accessing the other results from the durable storage.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×