Database aggregation query result estimator
First Claim
1. A method of providing an estimated aggregation result for a set of values having first values and second values, the method comprising:
- obtaining a first result of aggregating the first values and a second result of aggregating the second values; and
estimating an aggregation result for the set of values based on the first and second results.
1 Assignment
0 Petitions
Accused Products
Abstract
Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data is extrapolated and added to the aggregated outlier values to provide an estimate for each aggregation query. Outlier values are identified by selecting values outside of a selected sliding window of data having the lowest variance. An index is created for the outlier values. The outlier data is removed from the window of data, and separately aggregated. The remaining data without the outliers is then sampled to provide a statistically relevant sample that is then aggregated and extrapolated to provide an estimate for the remaining data. This sampled estimate is combined with the outlier aggregate to form an estimate for the entire set of data.
33 Citations
10 Claims
-
1. A method of providing an estimated aggregation result for a set of values having first values and second values, the method comprising:
-
obtaining a first result of aggregating the first values and a second result of aggregating the second values; and
estimating an aggregation result for the set of values based on the first and second results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification