Hierarchy of Servers for Query Processing of Column Chunks in a Distributed Column Chunk Data Store
First Claim
1. A method comprising:
- partitioning, by one or more computing devices, each column of a data table comprising one or more columns and one or more rows into two or more column chunks of a plurality of column chunks;
constructing, by the one or more computing devices, a plurality of parity column chunks, wherein each parity column chunk is computed using two or more column chunks of the plurality of column chunks;
individually distributing and storing, by the one or more computing devices, the plurality of column chunks and the plurality of parity column chunks across a plurality of storage servers.
3 Assignments
0 Petitions
Accused Products
Abstract
An improved system and method for query processing in a distributed column chunk data store is provided. A distributed column chunk data store may be provided by multiple storage servers operably coupled to a network. A storage server provided may include a database engine for partitioning a data table into the column chunks for distributing across multiple storage servers, a storage shared memory for storing the column chunks during processing of semantic operations performed on the column chunks, and a storage services manager for striping column chunks of a partitioned data table across multiple storage servers. Query processing may be performed by storage servers or query processing servers operably coupled by a network to storage servers in the column chunk data store. To do so, a hierarchy of servers may be dynamically determined to process execution steps of a query transformed for distributed processing.
-
Citations
20 Claims
-
1. A method comprising:
-
partitioning, by one or more computing devices, each column of a data table comprising one or more columns and one or more rows into two or more column chunks of a plurality of column chunks; constructing, by the one or more computing devices, a plurality of parity column chunks, wherein each parity column chunk is computed using two or more column chunks of the plurality of column chunks; individually distributing and storing, by the one or more computing devices, the plurality of column chunks and the plurality of parity column chunks across a plurality of storage servers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
partitioning, by a first server, each column of a data table comprising one or more columns and one or more rows into two or more column chunks of a plurality of column chunks; constructing, by the first server, a plurality of parity column chunks, wherein each parity column chunk is computed using two or more column chunks of the plurality of column chunks; individually distributing and storing, by the first server, the plurality of column chunks and the plurality of parity column chunks across a plurality of second servers; receiving, at the first server, a request for one or more data of the data table; transforming, by the first server, the query into a plurality of sub-queries; distributing, by the first server, the plurality of sub-queries across a plurality of third servers; receiving, at the first server, a plurality of sub-results from the plurality of third servers; constructing, by the first server, a result comprising the one or more data by combining the plurality of sub-results; transmitting, by the first server, the result in response to the query. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. One or more computer-readable non-transitory storage media embodying software operable when executed by one or more computer systems to:
-
partition each column of a data table comprising one or more columns and one or more rows into two or more column chunks of a plurality of column chunks; construct a plurality of parity column chunks, wherein each parity column chunk is computed using two or more column chunks of the plurality of column chunks; individually distribute and store the plurality of column chunks and the plurality of parity column chunks across a plurality of storage servers. - View Dependent Claims (19, 20)
-
Specification