Hate speech detection is not as easy as you may think: A closer look at model validation (extended version)
作者:
Highlights:
• The state-of-the-art results are highly overestimated due to experimental issues.
• User distribution on datasets has an impact on the classification results.
• Better user-distributed datasets lead to better generalization.
• Improving English models generalization is a first step toward crosslingual models.
摘要
•The state-of-the-art results are highly overestimated due to experimental issues.•User distribution on datasets has an impact on the classification results.•Better user-distributed datasets lead to better generalization.•Improving English models generalization is a first step toward crosslingual models.
论文关键词:Hate speech classification,Experimental evaluation,Social media,Deep learning
论文评审过程:Received 14 November 2019, Revised 13 May 2020, Accepted 26 June 2020, Available online 30 June 2020, Version of Record 24 December 2021.
论文官网地址:https://doi.org/10.1016/j.is.2020.101584