Exploring features for the automatic identification of user goals in web search

作者:

Highlights:

摘要

Queries submitted to search engines can be classified according to the user goals into three distinct categories: navigational, informational, and transactional. Such classification may be useful, for instance, as additional information for advertisement selection algorithms and for search engine ranking functions, among other possible applications. This paper presents a study about the impact of using several features extracted from the document collection and query logs on the task of automatically identifying the users’ goals behind their queries. We propose the use of new features not previously reported in literature and study their impact on the quality of the query classification task. Further, we study the impact of each feature on different web collections, showing that the choice of the best set of features may change according to the target collection.The results obtained indicate the new proposed set of features improves the quality of the classification task when compared to previous proposals. We report experiments with two web collections where we were able to obtain 82.5% and 77.67% of overall accuracy when classifying queries according to the three distinct user goals studied.

论文关键词:Information retrieval,Query classification,Machine learning,SVM,Web search

论文评审过程:Received 21 June 2008, Revised 26 August 2009, Accepted 3 September 2009, Available online 13 October 2009.

论文官网地址:https://doi.org/10.1016/j.ipm.2009.09.003