Evaluation of online emoji description resources for sentiment analysis purposes

作者:

Highlights:

摘要

Emoji sentiment analysis is a relevant research topic nowadays, for which emoji sentiment lexica are key assets. Manual annotation affects directly their quality (where high quality usually corresponds to high self-agreement and inter-agreement).In this work we present an unsupervised methodology to evaluate emoji sentiment lexica generated from online resources, based on a correlation analysis between a gold standard and the scores resulting from the sentiment analysis of the emoji descriptions in those resources. We consider in our study four such online resources of emoji descriptions: Emojipedia, Emojis.wiki, CLDR emoji character annotations and iEmoji. These resources provide knowledge about real (intended) emoji meanings from different author approaches and perspectives. We also present the automatic creation of a joint lexicon where the sentiment of a given emoji is obtained by averaging its scores from the unsupervised analysis of all the resources involved. The results for the joint lexicon are highly promising, suggesting that valuable subjective information can be inferred from authors’ descriptions in online resources.

论文关键词:Natural Language Processing,Sentiment analysis,Emoji,Emoji lexica,Social media

论文评审过程:Received 30 July 2020, Revised 15 January 2021, Accepted 22 May 2021, Available online 26 June 2021, Version of Record 12 July 2021.

论文官网地址:https://doi.org/10.1016/j.eswa.2021.115279