LCA-based algorithms for efficiently processing multiple keyword queries over XML streams
作者:
Highlights:
•
摘要
In a stream environment, differently from traditional databases, data arrive continuously, unindexed and potentially unbounded, whereas queries must be evaluated for producing results on the fly. In this article, we propose two new algorithms (called SLCAStream and ELCAStream) for processing multiple keyword queries over XML streams. Both algorithms process keyword-based queries that require minimal or no schema knowledge to be formulated, follow the lowest common ancestor (LCA) semantics, and provide optimized methods to improve the overall performance. Moreover, SLCAStream, which implements the smallest LCA (SLCA) semantics, outperforms the state-of-the-art, with up to 49% reduction in response time and 36% in memory usage. In turn, ELCAStream is the first to explore the exclusive LCA (ELCA) semantics over XML streams.A comprehensive set of experiments evaluates several aspects related to performance and scalability of both algorithms, which shows they are effective alternatives to search services over XML streams.
论文关键词:Multi-query processing,Keyword-based queries,XML streams,LCA semantics
论文评审过程:Received 5 September 2014, Revised 18 December 2015, Accepted 13 March 2016, Available online 31 March 2016, Version of Record 16 May 2016.
论文官网地址:https://doi.org/10.1016/j.datak.2016.03.001