A machine learning approach for result caching in web search engines

作者:

Highlights:

• To the best of our knowledge, our work is therst in literature to apply machine learning techniques to the result caching problem in search engines, for both static, dynamic, and state-of-the-art static-dynamic cache organizations.

• We evaluate a large set of features and illustrate that they can be exploited to increase the hit rate of result caches.

• We evaluate various oracle caching strategies to illustrate the potential room for improvement in the result caching problem.

• We show that the proposed machine learning framework can improve the hit rate of result caches, potentially reducing the energy consumption in search engines.

摘要

•To the best of our knowledge, our work is therst in literature to apply machine learning techniques to the result caching problem in search engines, for both static, dynamic, and state-of-the-art static-dynamic cache organizations.•We evaluate a large set of features and illustrate that they can be exploited to increase the hit rate of result caches.•We evaluate various oracle caching strategies to illustrate the potential room for improvement in the result caching problem.•We show that the proposed machine learning framework can improve the hit rate of result caches, potentially reducing the energy consumption in search engines.

论文关键词:Query result caching,Machine learning,Feature-based caching,Static caching,Static-dynamic caching

论文评审过程:Received 5 September 2016, Revised 4 January 2017, Accepted 9 February 2017, Available online 11 March 2017, Version of Record 11 March 2017.

论文官网地址:https://doi.org/10.1016/j.ipm.2017.02.006