Efficient top-k processing in large-scaled distributed environments

作者:

Highlights:

摘要

The rapid development of networking technologies has made it possible to construct a distributed database that involves a huge number of sites. Query processing in such a large-scaled system poses serious challenges beyond the scope of traditional distributed algorithms. In this paper, we propose a new algorithm BRANCA for performing top-k retrieval in these environments. Integrating two orthogonal methodologies “semantic caching” and “routing indexes”, BRANCA is able to solve a query by accessing only a small number of servers. Our algorithmic findings are accompanied with a solid theoretical analysis, which rigorously proves the effectiveness of BRANCA. Extensive experiments verify that our technique outperforms the existing methods significantly.

论文关键词:Top-k,Distributed database,Caching

论文评审过程:Received 5 October 2006, Revised 30 January 2007, Accepted 2 March 2007, Available online 2 April 2007.

论文官网地址:https://doi.org/10.1016/j.datak.2007.03.012