Apparatus and Method for Operating a Distributed Database with Foreign Tables
First Claim
1. A system, comprising:
- a coordinator node; and
a plurality of worker nodes in communication with the coordinator node, wherein each worker node stores data blocks, wherein each data block has data in a semi-structured format and each data block has an associated foreign table declaration specifying conversion of the data in the semi-structured format into a tabular format interpretable by a query language;
a query processor executed by the coordinator node to produce a distributed query plan in response to a query language query, wherein the distributed query plan includes sub-queries;
wherein the sub-queries are executed by selected worker nodes of the plurality of worker nodes; and
wherein the selected worker nodes use foreign table declarations to convert data in semi-structured formats into tabular formats of a distributed database to provide tabular data in response to the query language query.
2 Assignments
0 Petitions
Accused Products
Abstract
A system includes a coordinator node and worker nodes in communication with the coordinator node. Each worker node stores data blocks. Each data block has data in a semi-structured format and each data block has an associated foreign table declaration specifying conversion of the data in the semi-structured format into a tabular format interpretable by a query language. A query processor executed by the coordinator node produces a distributed query plan in response to a query language query. The distributed query plan includes sub-queries. The sub-queries are executed by selected worker nodes of the worker nodes. The selected worker nodes use foreign table declarations to convert data in semi-structured formats into tabular formats of a distributed database to provide tabular data in response to the query language query.
111 Citations
37 Claims
-
1. A system, comprising:
-
a coordinator node; and a plurality of worker nodes in communication with the coordinator node, wherein each worker node stores data blocks, wherein each data block has data in a semi-structured format and each data block has an associated foreign table declaration specifying conversion of the data in the semi-structured format into a tabular format interpretable by a query language; a query processor executed by the coordinator node to produce a distributed query plan in response to a query language query, wherein the distributed query plan includes sub-queries; wherein the sub-queries are executed by selected worker nodes of the plurality of worker nodes; and wherein the selected worker nodes use foreign table declarations to convert data in semi-structured formats into tabular formats of a distributed database to provide tabular data in response to the query language query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer implemented method, comprising:
-
storing data blocks on networked worker nodes, wherein the data blocks have data in semi-structured formats; associating the data blocks with foreign table declarations specifying conversions of the data in the semi-structured formats into tabular formats interpretable by a query language; producing a distributed query plan in response to a query language query, wherein the distributed query plan includes sub-queries; directing the sub-queries to selected worker nodes of the networked worker nodes; and executing the sub-queries at the selected worker nodes, wherein executing includes using the foreign table declarations to convert data in semi-structured formats into tabular formats to produce tabular data responsive to the query language query; and merging the tabular data to produce a query result. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
Specification