Combining evidence for automatic Web session identification

作者:

Highlights:

摘要

Contextual information provides an important basis for identifying and understanding users' information needs. Our previous work in traditional information retrieval systems has shown how using contextual information could improve retrieval performance. With the vast quantity and variety of information available on the Web, and the short query lengths within Web searches, it becomes even more crucial that appropriate contextual information is extracted to facilitate personalized services. However, finding users' contextual information is not straightforward, especially in the Web search environment where less is known about the individual users. In this paper, we will present an approach that has significant potential for studying Web users' search contexts. The approach automatically groups a user's consecutive search activities on the same search topic into one session. It uses Dempster–Shafer theory to combine evidence extracted from two sources, each of which is based on the statistical data from Web search logs. The evaluation we have performed demonstrates that our approach has achieved a significant improvement over previous methods of session identification.

论文关键词:Session identification,Search context,Dempster–Shafer theory,Web user logs

论文评审过程:Accepted 25 October 2001, Available online 6 December 2001.

论文官网地址:https://doi.org/10.1016/S0306-4573(01)00060-7