AMRank: An adversarial Markov ranking model combining short- and long-term returns
作者:
Highlights:
• AMRank, a novel document ranking model, is proposed, which combines MDP and GAN.
• AMRank employs long- and short-term returns to improve decision making.
• A sequence discriminator is presented to generate long-term returns.
• AMRank can realize one-step update and output return with less variance.
摘要
•AMRank, a novel document ranking model, is proposed, which combines MDP and GAN.•AMRank employs long- and short-term returns to improve decision making.•A sequence discriminator is presented to generate long-term returns.•AMRank can realize one-step update and output return with less variance.
论文关键词:Document ranking,Learning to rank,Reinforcement learning,Discriminator
论文评审过程:Received 27 November 2021, Revised 24 July 2022, Accepted 9 August 2022, Available online 22 August 2022, Version of Record 30 August 2022.
论文官网地址:https://doi.org/10.1016/j.eswa.2022.118512