A probabilistic clustering model for hate speech classification in twitter

作者:

Highlights:

• A probabilistic clustering model for hate speech classification in twitter was developed.

• The use of a naïve Bayes model to improve features representation.

• The use of a modified Jaccard similarity measure for clustering real-time tweet into topic clusters.

• The use of 4-level scale fuzzy model for hate speech classification.

• The Paired Sample t-Test validated the efficiency of the developed model.

摘要

•A probabilistic clustering model for hate speech classification in twitter was developed.•The use of a naïve Bayes model to improve features representation.•The use of a modified Jaccard similarity measure for clustering real-time tweet into topic clusters.•The use of 4-level scale fuzzy model for hate speech classification.•The Paired Sample t-Test validated the efficiency of the developed model.

论文关键词:Twitter,Hate speech,Fuzzy logic,Combinatorial algorithm,Bayesian function,Sentiment analysis

论文评审过程:Received 14 February 2020, Revised 12 February 2021, Accepted 18 February 2021, Available online 24 February 2021, Version of Record 8 March 2021.

论文官网地址:https://doi.org/10.1016/j.eswa.2021.114762