Query selectivity estimation with confidence interval
First Claim
Patent Images
1. A method that estimates a selectivity of a query expression on a database comprising:
- deriving a probability distribution for possible selectivity values for the query expression; and
evaluating the probability distribution using a desired selectivity confidence to derive the estimated selectivity.
3 Assignments
0 Petitions
Accused Products
Abstract
Selectivity estimates are produced that meet a desired confidence threshold. To determine the confidence level of a given selectivity estimate for a query expression, the query expression is evaluated on a sample tuples. A probability density function is derived based on the number of tuples in the sample that satisfy the query expression. The cumulative distribution for the probability density function is solved for the given threshold to determine a selectivity estimate at the given confidence value.
43 Citations
29 Claims
-
1. A method that estimates a selectivity of a query expression on a database comprising:
-
deriving a probability distribution for possible selectivity values for the query expression; and
evaluating the probability distribution using a desired selectivity confidence to derive the estimated selectivity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for estimating a selectivity of a query expression executed on a database given a desired selectivity confidence comprising:
-
a query expression evaluator that evaluates the query expression on a sample of database tuples to determine a an observed selectivity;
a probability distribution derivation module that derives a probability distribution of possible selectivity values for the query expression based on the observed selectivity; and
a probability distribution resolver that determines an estimated selectivity based on the desired selectivity confidence and the probability distribution. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. An apparatus for estimating a selectivity of a query expression executed on a database that stores tuples given a desired selectivity confidence comprising:
-
means for evaluating the query expression on a sample of database tuples to determine an observed selectivity;
means for deriving a probability distribution of possible selectivity values for the query expression based on the observed selectivity; and
means for determining a selectivity based on the desired selectivity confidence and the probability distribution. - View Dependent Claims (19, 20, 21, 22, 23)
-
-
24. One or more computer readable media having computer-executable instructions stored thereon for estimating a selectivity with a desired selectivity confidence for a query expression executed on a database, the instructions comprising:
-
evaluating the query expression on a sample of database tuples to determine an observed selectivity;
deriving a probability distribution of possible selectivity values for the query expression based on the observed selectivity; and
determining a selectivity based on the desired selectivity confidence and the probability distribution. - View Dependent Claims (25, 26, 27, 28, 29)
-
Specification