Hybrid online–offline learning to rank using simulated annealing strategy based on dependent click model
作者:Osman Ali Sadek Ibrahim, Eman M. G. Younis
摘要
Learning to rank (LTR) is the process of constructing a model for ranking documents or objects. It is useful for many applications such as Information retrieval (IR) and recommendation systems. This paper introduces a comparison between Offline and Online (LTR) for IR. It also proposes a novel Offline (1 + 1)-Simulated Annealing Strategy (SAS-Rank) and introduces the first Hybrid Online–Offline LTR techniques using SAS-Rank and ES-Rank with Online Dependent Click Model (DCM). SAS-Rank is a combination of Simulated Annealing method and Evolutionary Strategy. From the obtained experimental results, we can conclude that the Offline LTR techniques outperformed the well-known Online Dependent Click Model (DCM) technique. Moreover, the Hybrid Online–Offline SAS-Click outperformed the predictive ranking results on unseen data in most evaluation fitness metrics using LETOR 4 dataset compared to other approaches. On the other hand, Hybrid ES-Click is a competitive approach with SAS-Click in evolving ranking models for training and validation data. Regarding Offline LTR, the SAS-Rank outperformed the well-known ES-Rank which has been compared in previous studies with fourteen machine learning techniques. This research uses the best available Linear LTR approaches existing in the literature which are offline ES-Rank with Online DCM. The linear LTR approach output is a linear ranking model which can be represented as a vector of feature importance weights. This paper demonstrated the results and findings obtained using the LETOR 4 dataset, and Java Archive Package is provided for facilitating reproducible research.
论文关键词:Learning to rank, Simulated annealing strategy, Dependent click model, Hybrid online–offline learning
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10115-022-01726-0