Detecting health misinformation in online health communities: Incorporating behavioral features into machine learning based approaches

作者：

Highlights：

• Based on the Elaboration Likelihood Model (ELM), the features of online health misinformation can be classified into two levels: central-level and peripheral-level.

• Four types of misinformation appear in online health communities: advertising, propaganda, misleading information, and unrelated information.

• We built a health misinformation detection model integrating the linguistic features, the topic features, the sentiment features, and the behavioral features.

• The proposed model, as well as the features, were validated on a real-world dataset, being able to correctly detect about 85% of health misinformation.

• The behavioral features are more informative than linguistic features in detecting health misinformation in online health communities.

摘要

•Based on the Elaboration Likelihood Model (ELM), the features of online health misinformation can be classified into two levels: central-level and peripheral-level.•Four types of misinformation appear in online health communities: advertising, propaganda, misleading information, and unrelated information.•We built a health misinformation detection model integrating the linguistic features, the topic features, the sentiment features, and the behavioral features.•The proposed model, as well as the features, were validated on a real-world dataset, being able to correctly detect about 85% of health misinformation.•The behavioral features are more informative than linguistic features in detecting health misinformation in online health communities.

论文关键词：Health misinformation,Misinformation detection,Online health community

论文评审过程：Received 28 January 2020, Revised 9 September 2020, Accepted 15 September 2020, Available online 6 October 2020, Version of Record 6 October 2020.

论文官网地址：https://doi.org/10.1016/j.ipm.2020.102390