×

BACKGROUND FORMAT OPTIMIZATION FOR ENHANCED SQL-LIKE QUERIES IN HADOOP

  • US 20150095308A1
  • Filed: 10/01/2013
  • Published: 04/02/2015
  • Est. Priority Date: 10/01/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system for performing queries on stored data in a distributed computing cluster of a plurality of data nodes, comprising:

  • a query engine for each data node, having;

    a query planner that parses a query from a client to create query fragments based on a schema specifying one or more formats in which data is stored on the data nodes,wherein, when data in a target format is stored, the query fragments are created for the target format, and when data in the target format is not stored, the query fragments are created for another format;

    a query coordinator that distributes the query fragments among the plurality of data nodes; and

    a query execution engine comprising;

    a transformation module that transforms the data in the format for which the query fragments are created based on the schema; and

    an execution module that executes the query fragments on the transformed data to obtain intermediate results that are aggregated and returned to the client.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×