Combined mining of Web server logs and web contents for classifying user navigation patterns and predicting users’ future requests

作者:

Highlights:

摘要

We present a study of the automatic classification of web user navigation patterns and propose a novel approach to classifying user navigation patterns and predicting users’ future requests. The approach is based on the combined mining of Web server logs and the contents of the retrieved web pages. The textual content of web pages is captured through extraction of character N-grams, which are combined with Web server log files to derive user navigation profiles. The approach is implemented as an experimental system, and its performance is evaluated based on two tasks: classification and prediction. The system achieves the classification accuracy of nearly 70% and the prediction accuracy of about 65%, which is about 20% higher than the classification accuracy by mining Web server logs alone. This approach may be used to facilitate better web personalization and website organization.

论文关键词:Web usage mining,Web content mining,User navigation profiles,Classification,Prediction

论文评审过程:Received 19 August 2005, Revised 3 May 2006, Accepted 10 June 2006, Available online 7 July 2006.

论文官网地址:https://doi.org/10.1016/j.datak.2006.06.001