SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis
作者:
Highlights:
•
摘要
Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.
论文关键词:Sentiment analysis,Natural Language Processing (NLP),Movie reviews,Cornell,Feature selection,Support Vector Machine
论文评审过程:Received 22 August 2015, Revised 13 February 2016, Accepted 18 February 2016, Available online 27 February 2016, Version of Record 2 April 2016.
论文官网地址:https://doi.org/10.1016/j.knosys.2016.02.011