Multi-dimensional selectivity estimation method using compressed histogram information
First Claim
1. A multi-dimensional selectivity estimation method using compressed histogram information to obtain the statistics approximating data distribution of a database for the estimation of database query selectivity, said method comprising the steps of:
- dividing the data distribution to generate a large number of small-sized multi-dimensional histogram buckets;
compressing the histogram information from the multi-dimensional histogram buckets using a multi-dimensional discrete cosine transform(DCT) and storing the compressed information; and
estimating the query selectivity by using the compressed and stored histogram information as the statistics.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a multi-dimensional selectivity estimation method using compressed histogram information which the database query optimizer in a database management system uses to find the most efficient execution plan among all possible plans. The method includes the several steps to generate a large number of small-sized multi-dimensional histogram buckets, sampling DCT coefficients which have high values with high probability, compressing information from the multi-dimensional histogram buckets using a multi-dimensional discrete cosine transform(DCT) and storing compressed information, and estimating the query selectivity by using compressed and stored histogram information as the statistics.
131 Citations
6 Claims
-
1. A multi-dimensional selectivity estimation method using compressed histogram information to obtain the statistics approximating data distribution of a database for the estimation of database query selectivity, said method comprising the steps of:
-
dividing the data distribution to generate a large number of small-sized multi-dimensional histogram buckets;
compressing the histogram information from the multi-dimensional histogram buckets using a multi-dimensional discrete cosine transform(DCT) and storing the compressed information; and
estimating the query selectivity by using the compressed and stored histogram information as the statistics. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification