Incorporating appraisal expression patterns into topic modeling for aspect and sentiment word identification
作者:
Highlights:
•
摘要
With the considerable growth of user-generated content, online reviews are becoming extremely valuable sources for mining customers’ opinions on products and services. However, most of the traditional opinion mining methods are coarse-grained and cannot understand natural languages. Thus, aspect-based opinion mining and summarization are of great interest in academic and industrial research. In this paper, we study an approach to extract product and service aspect words, as well as sentiment words, automatically from reviews. An unsupervised dependency analysis-based approach is presented to extract Appraisal Expression Patterns (AEPs) from reviews, which represent the manner in which people express opinions regarding products or services and can be regarded as a condensed representation of the syntactic relationship between aspect and sentiment words. AEPs are high-level, domain-independent types of information, and have excellent domain adaptability. An AEP-based Latent Dirichlet Allocation (AEP-LDA) model is also proposed. This is a sentence-level, probabilistic generative model which assumes that all words in a sentence are drawn from one topic – a generally true assumption, based on our observation. The model also assumes that every review corpus is composed of several mutually corresponding aspect and sentiment topics, as well as a background word topic. The AEP information is incorporated into the AEP-LDA model for mining aspect and sentiment words simultaneously. The experimental results on reviews of restaurants, hotels, MP3 players, and cameras show that the AEP-LDA model outperforms other approaches in identifying aspect and sentiment words.
论文关键词:Opinion mining,Appraisal expression pattern,Dependency analysis,Topic modeling,Aspect and sentiment analysis
论文评审过程:Received 12 March 2013, Revised 1 February 2014, Accepted 2 February 2014, Available online 11 February 2014.
论文官网地址:https://doi.org/10.1016/j.knosys.2014.02.003