Reordering of database records for improved compression
First Claim
1. A system comprising:
- a memory; and
a processor configured to;
prioritize a plurality of fields of an unordered set of data records from a structured database based on an impact of those fields to a performance metric for accessing data stored in a column-oriented compressed database;
select one or more of the prioritized fields based on priority of the prioritized fields, wherein the priority is based on a frequency of access of the fields;
sort the unordered set of data records based on the selected one or more prioritized fields for compression and optimal performance; and
compress data of the sorted data records according to a compression scheme to increase performance for accessing the data stored in the column-oriented compressed database.
1 Assignment
0 Petitions
Accused Products
Abstract
According to embodiments of the present invention, apparatus, systems, methods and computer program products for sorting and compressing an unordered set of data records from a structured database are provided. Fields of the unordered set of data records are prioritized based on an impact of those fields to a compression scheme for column-oriented compression. The unordered set of data records are sorted based on the prioritized field(s) with a greatest impact on the performance metric. Data of the sorted data records are compressed according to a compression scheme. In some embodiments, prioritizing the fields may be based on an anticipated level of usage of data within those fields and/or a cost function associated with a performance metric as well as optimization of compression. A performance metric may include a faster computational time, reduced I/O computation, faster scan time, etc.
-
Citations
12 Claims
-
1. A system comprising:
-
a memory; and a processor configured to; prioritize a plurality of fields of an unordered set of data records from a structured database based on an impact of those fields to a performance metric for accessing data stored in a column-oriented compressed database; select one or more of the prioritized fields based on priority of the prioritized fields, wherein the priority is based on a frequency of access of the fields; sort the unordered set of data records based on the selected one or more prioritized fields for compression and optimal performance; and compress data of the sorted data records according to a compression scheme to increase performance for accessing the data stored in the column-oriented compressed database. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product for providing information to a user, comprising a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code, when executed by a processor, causes the processor to:
-
prioritize a plurality of fields of an unordered set of data records from a structured database based on an impact of those fields to a performance metric for accessing data stored in a column-oriented compressed database; select one or more of the prioritized fields based on priority of the prioritized fields, wherein the priority is based on a frequency of access of the fields; sort the unordered set of data records based on the selected one or more prioritized fields for compression and optimal performance; and compress data of the sorted data records according to a compression scheme to increase performance for accessing the data stored in the column-oriented compressed database. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification