Effective hate-speech detection in Twitter data using recurrent neural networks

作者:Georgios K. Pitsilis, Heri Ramampiaro, Helge Langseth

摘要

This paper addresses the important problem of discerning hateful content in social media. We propose a detection scheme that is an ensemble of Recurrent Neural Network (RNN) classifiers, and it incorporates various features associated with user-related information, such as the users’ tendency towards racism or sexism. This data is fed as input to the above classifiers along with the word frequency vectors derived from the textual content. We evaluate our approach on a publicly available corpus of 16k tweets, and the results demonstrate its effectiveness in comparison to existing state-of-the-art solutions. More specifically, our scheme can successfully distinguish racism and sexism messages from normal text, and achieve higher classification quality than current state-of-the-art algorithms.

论文关键词:Text classification, Micro-blogging, Hate-speech, Deep learning, Recurrent neural networks, Twitter

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-018-1242-y