A valences-totaling model for English sentiment classification

作者:Vo Ngoc Phu, Vo Thi Ngoc Chau, Nguyen Duy Dat, Vo Thi Ngoc Tran, Tuan A. Nguyen

摘要

Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.

论文关键词:English document semantic classification, Cosine measure, Valences-totaling model

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-017-1054-0