COLUMN ORDERING FOR INPUT/OUTPUT OPTIMIZATION IN TABULAR DATA
First Claim
1. A system comprising:
- a processor;
a memory storing a column ordering framework, for actuation by the processor, the column ordering framework comprising;
a workload manager module configured process data regarding a first column layout of a tabular storage to determine a query workload associated with the first column layout; and
an order generating module configured to calculate a second column layout for the tabular storage based at least in part on the query workload.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer-readable media for determining column ordering of a data storage table for search optimization are described herein. In some examples, a computing system is configured to receive input containing statistics of a plurality of queries. The computing system can then determine a new column order (i.e., layout) based at least in part on the statistics. In some example techniques described herein, the computing system can determine the new column order based at least in part on the hardware components storing the data storage table, storage system parameters, and/or user preference information. Example techniques described herein can apply the new column order to data subsequently added to the data storage table. Example techniques described herein can apply the new column order to existing data in the data storage table.
-
Citations
10 Claims
-
1. A system comprising:
-
a processor; a memory storing a column ordering framework, for actuation by the processor, the column ordering framework comprising; a workload manager module configured process data regarding a first column layout of a tabular storage to determine a query workload associated with the first column layout; and an order generating module configured to calculate a second column layout for the tabular storage based at least in part on the query workload. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method, comprising:
-
ascertaining a first order for columns of a table, the table comprising a plurality of columns; receiving statistics associated with one or more queries; determining a first cost for the one or more queries based at least in part on the statistics; determining a second order for columns of the table, the second order having a lower cost than the first cost; and applying the second order to create an ordered table. - View Dependent Claims (5, 6, 7)
-
-
8. A computer-readable medium having thereon computer-executable instructions, the computer-executable instructions responsive to execution configuring a device to perform operations comprising:
-
accessing a table comprising a first plurality of columns; identifying at least two related columns of the first plurality of columns; determining whether the at least two related columns of the first plurality of columns are close to one another; responsive to determining the at least two related columns of the first plurality of columns are not close to one another, determining a new order for the first plurality of columns based at least in part on locating the at least two related columns close to one another; and applying the new order to a second plurality of columns to create an ordered table, the second plurality of columns being added to the table subsequent to the determining the new order for the first plurality of columns. - View Dependent Claims (9, 10)
-
Specification