Discovering emotion and reasoning its flip in multi-party conversations using masked memory network and transformer

作者：

Highlights：

•

摘要

Efficient discovery of a speaker’s emotional states in a multi-party conversation is significant to design human-like conversational agents. During a conversation, the cognitive state of a speaker often alters due to certain past utterances, which may lead to a flip in their emotional state. Therefore, discovering the reasons (triggers) behind the speaker’s emotion-flip during a conversation is essential to explain the emotion labels of individual utterances. In this paper, along with addressing the task of emotion recognition in conversations (ERC), we introduce a novel task – Emotion-Flip Reasoning (EFR), that aims to identify past utterances which have triggered one’s emotional state to flip at a certain time. We propose a masked memory network to address the former and a Transformer-based network for the latter task. To this end, we consider MELD, a benchmark emotion recognition dataset in multi-party conversations for the task of ERC, and augment it with new ground-truth labels for EFR. An extensive comparison with five state-of-the-art models suggests improved performances of our models for both the tasks. We further present anecdotal evidence and both qualitative and quantitative error analyses to support the superiority of our models compared to the baselines.

论文关键词：Emotion recognition,Emotion-Flip Reasoning,Multi-party conversations

论文评审过程：Received 23 March 2021, Revised 29 October 2021, Accepted 30 December 2021, Available online 10 January 2022, Version of Record 22 January 2022.

论文官网地址：https://doi.org/10.1016/j.knosys.2021.108112