Mining web logs to improve hit ratios of prefetching and caching

作者:

Highlights:

摘要

In the Internet, proxy servers play the key roles between users and web sites, which could reduce the response time of user requests and save network bandwidth. Basically, an efficient buffer manager should be built in a proxy server to cache frequently accessed documents in the buffer, thereby achieving better response time. In the paper, we developed an access sequence miner to mine popular surfing 2-sequences with their conditional probabilities from the proxy log, and stored them in the rule table. Then, according to buffer contents and the rule table, a prediction-based buffer manager also developed here will make appropriate actions such as document caching, document prefetching, and even cache/prefetch buffer size adjusting to achieve better buffer utilization. Through the simulation, we found that our approach has much better performance than the other ones, in the quantitative measures such as hit ratios and byte hit ratios of accessed documents.

论文关键词:Web mining,Proxy servers,Caching,Prefetching,Web access prediction

论文评审过程:Received 4 August 2006, Revised 5 November 2006, Accepted 16 November 2006, Available online 8 December 2006.

论文官网地址:https://doi.org/10.1016/j.knosys.2006.11.004