Reddit entity linking dataset
作者:
Highlights:
• We release a new entity linking dataset taken from Reddit.
• Human annotators perform well at the task even when given a broad definition of the goal.
• Thorough evaluation found that existing entity linkers perform poorly on this new dataset.
• New models are needed to extract information from social media commentary.
摘要
•We release a new entity linking dataset taken from Reddit.•Human annotators perform well at the task even when given a broad definition of the goal.•Thorough evaluation found that existing entity linkers perform poorly on this new dataset.•New models are needed to extract information from social media commentary.
论文关键词:Entity linking,Dataset,Natural language processing
论文评审过程:Received 3 September 2020, Revised 10 December 2020, Accepted 17 December 2020, Available online 5 February 2021, Version of Record 5 February 2021.
论文官网地址:https://doi.org/10.1016/j.ipm.2020.102479