An empirical study of the design choices for local citation recommendation systems
作者:
Highlights:
• Certain design choices affect the accuracy of local citation recommendation systems.
• Training reranker model with a strict regime improves the model’s performance.
• Triplet-based reranking models benefit from non-random negative sampling strategies.
• The best negative sampling strategy for triplet construction depends on the dataset.
摘要
•Certain design choices affect the accuracy of local citation recommendation systems.•Training reranker model with a strict regime improves the model’s performance.•Triplet-based reranking models benefit from non-random negative sampling strategies.•The best negative sampling strategy for triplet construction depends on the dataset.
论文关键词:Natural language processing,Citation recommendation,Information retrieval,BM25,SPECTER,Negative sampling
论文评审过程:Received 30 July 2021, Revised 18 January 2022, Accepted 7 March 2022, Available online 19 March 2022, Version of Record 29 March 2022.
论文官网地址:https://doi.org/10.1016/j.eswa.2022.116852