GoFast: Graph-based optimization for efficient and scalable query evaluation
作者:
Highlights:
• We formalize a logical model describing the query execution of RDF systems.
• We present the essential statistics collected for each graph fragment.
• We detail a cost model comparing execution plans based on the disk and network costs.
• We present a study of the problem allowing to choose the optimal execution plan.
• We prove the problem’s complexity and provide a branch and bound solution.
• We present a new strategy to eliminate upstream not-relevant data fragments.
摘要
•We formalize a logical model describing the query execution of RDF systems.•We present the essential statistics collected for each graph fragment.•We detail a cost model comparing execution plans based on the disk and network costs.•We present a study of the problem allowing to choose the optimal execution plan.•We prove the problem’s complexity and provide a branch and bound solution.•We present a new strategy to eliminate upstream not-relevant data fragments.
论文关键词:Optimization,RDF,SPARQL,Cardinality estimation,Cost model
论文评审过程:Received 22 July 2020, Revised 16 December 2020, Accepted 7 February 2021, Available online 17 February 2021, Version of Record 26 February 2021.
论文官网地址:https://doi.org/10.1016/j.is.2021.101738