Discovering better navigation sequences for the session construction problem

作者:

Highlights:

摘要

In this paper, we propose a novel page view based session model and session construction method to address the Web Usage Mining (WUM) problem. Unlike the simple session models, where sessions are sequences of web pages requested from the server (or served from a browser/proxy cache) and viewed in the browser (which may not guarantee a direct relationship between subsequent web pages in the session), we define a more realistic session model in which a session is a set of paths traversed in the web graph that corresponds to a user navigation performed by following links on web pages. We define the session construction process from raw server logs as a new graph problem and present a novel algorithm, Smart-SRA (Smart Session Reconstruction Algorithm), to solve this problem efficiently. An experimental evaluation based on data collected from real web access scenarios showed that Smart-SRA produces more accurate user sessions than the session construction methods found in the literature.

论文关键词:Web mining,Mining methods and algorithms

论文评审过程:Received 23 June 2011, Revised 14 November 2011, Accepted 15 November 2011, Available online 2 December 2011.

论文官网地址:https://doi.org/10.1016/j.datak.2011.11.005