Polarity Classification of Twitter Messages using Audio Processing

作者：

Highlights：

•

摘要

Polarity classification is one of the most fundamental problems in sentiment analysis. In this paper, we propose a novel method, Sound Cosine Similaritye Matching, for polarity classification of Twitter messages which incorporates features based on audio data rather than on grammar or other text properties, i.e., eliminates the dependency on external dictionaries. It is useful especially for correctly identifying misspelled or shortened words that are frequently encountered in text from online social media. Method performance is evaluated in two levels: i) capture rate of the misspelled and shortened words, ii) classification performance of the feature set. Our results show that classification accuracy is improved, compared to two other models in the literature, when the proposed features are used.

论文关键词：Sentiment analysis,Twitter,Audio processing,Machine learning,Text normalization

论文评审过程：Received 21 November 2019, Revised 18 May 2020, Accepted 15 June 2020, Available online 15 July 2020, Version of Record 15 July 2020.

论文官网地址：https://doi.org/10.1016/j.ipm.2020.102346