Query Size Estimation by Adaptive Sampling
作者:
Highlights:
•
摘要
We present an adaptive, random sampling algorithm for estimating the size of general queries. The algorithm can be used for any query D over a database D such that (1) for some n, the answer to L can be partitioned into n disjoint subsets L1, L2, ..., Ln, and (2) for 1 ≤ i ≤ n, the size of Li, is bounded by some function b(D, L), and (3) there is some algorithm by which we can compute the size of Li, where i is chosen randomly. We consider the performance of the algorithm on three special cases of the algorithm: join queries, transitive closure queries, and general recursive Datalog queries.
论文关键词:
论文评审过程:Available online 25 May 2002.
论文官网地址:https://doi.org/10.1006/jcss.1995.1050