A probabilistic clustering model for hate speech classification in twitter

作者：

Highlights：

• A probabilistic clustering model for hate speech classification in twitter was developed.

• The use of a naïve Bayes model to improve features representation.

• The use of a modified Jaccard similarity measure for clustering real-time tweet into topic clusters.

• The use of 4-level scale fuzzy model for hate speech classification.

• The Paired Sample t-Test validated the efficiency of the developed model.

摘要

•A probabilistic clustering model for hate speech classification in twitter was developed.•The use of a naïve Bayes model to improve features representation.•The use of a modified Jaccard similarity measure for clustering real-time tweet into topic clusters.•The use of 4-level scale fuzzy model for hate speech classification.•The Paired Sample t-Test validated the efficiency of the developed model.

论文关键词：Twitter,Hate speech,Fuzzy logic,Combinatorial algorithm,Bayesian function,Sentiment analysis

论文评审过程：Received 14 February 2020, Revised 12 February 2021, Accepted 18 February 2021, Available online 24 February 2021, Version of Record 8 March 2021.

论文官网地址：https://doi.org/10.1016/j.eswa.2021.114762