Two feature weighting approaches for naive Bayes text classifiers

作者:

Highlights:

摘要

This paper works on feature weighting approaches for naive Bayes text classifiers. Almost all existing feature weighting approaches for naive Bayes text classifiers have some defects: limited improvement to classification performance of naive Bayes text classifiers or sacrificing the simplicity and execution time of the final models. In fact, feature weighting is not new for machine learning community, and many researchers have made fruitful efforts in the field of feature weighting. This paper reviews some simple and efficient feature weighting approaches designed for standard naive Bayes classifiers, and adapts them for naive Bayes text classifiers. As a result, this paper proposes two adaptive feature weighting approaches for naive Bayes text classifiers. Experimental results based on benchmark and real-world data show that, compared to their competitors, our feature weighting approaches show higher classification accuracy, yet at the same time maintain the simplicity and lower execution time of the final models.

论文关键词:Naive Bayes text classifiers,Feature weighting,Gain ratio,Decision tree

论文评审过程:Received 23 November 2015, Revised 22 February 2016, Accepted 23 February 2016, Available online 2 March 2016, Version of Record 2 April 2016.

论文官网地址:https://doi.org/10.1016/j.knosys.2016.02.017