A survey of statistical approaches for query expansion

作者:Muhammad Ahsan Raza, Rahmah Mokhtar, Noraziah Ahmad

摘要

A major issue in effective information retrieval is the problem of vocabulary mismatches. The method called query expansion addresses this issue by reformulating each search query with additional terms that better define the information needs of the user. Many researchers have contributed to improving the accuracy of information retrieval systems, through different approaches to query expansion. In this article, we primarily discuss statistical query expansion approaches that include document analysis, search and browse log analyses, and web knowledge analyses. In addition to proposing a comprehensive classification for these approaches, we also briefly analyse the pros and cons of each technique. Finally, we evaluate these techniques using five functional features and experimental settings such as TREC collection and results of performance metrics. An in-depth survey of different statistical query expansion approaches suggests that the selection of the best approach depends on the type of search query, the nature and availability of data resources, and performance efficiency requirements.

论文关键词:Information retrieval, Statistical approaches, Query expansion, Document analysis, Query log analysis

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-018-1269-8