A relational model for XML structural joins and their size estimations

作者:Cheng Luo, Zhewei Jiang, Wen-Chi Hou, Feng Yan, Qiang Zhu

摘要

XML structural joins, which evaluate the containment (ancestor-descendant) relationships between XML elements, are important operations of XML query processing. Estimating structural join size accurately and quickly is crucial to the success of XML query plan selection and the query optimization. XML structural joins are essentially complex θ-joins, which render well-known estimation techniques for relational equijoins, such as discrete cosine transform, wavelet transform, and sketch, not applicable. In this paper, we model structural joins from a relational point of view and convert the complex θ-joins to equijoins so that those well-known estimation techniques become applicable to structural join size estimation. Theoretical analyses and extensive experiments have been performed on these estimation methods. It is shown that discrete cosine transform requires the least memory and yields the best estimates among the three techniques. Compared with state-of-the-art method IM-DA-Est, discrete cosine transform is much faster, requires less memory, and yields comparable estimates.

论文关键词:Semi-structured databases, XML databases, Query optimization, Selectivity estimation

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-007-0089-z