Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values

作者:

Highlights:

摘要

Each type of classifier has its own advantages as well as certain shortcomings. In this paper, we take the advantages of the associative classifier and the Naïve Bayes Classifier to make up the shortcomings of each other, thus improving the accuracy of text classification. We will classify the training cases with the Naïve Bayes Classifier and set different confidence threshold values for different class association rules (CARs) to different classes by the obtained classification accuracy rate of the Naïve Bayes Classifier to the classes. Since the accuracy rates of all selected CARs of the class are higher than that obtained by the Naïve Bayes Classifier, we could further optimize the classification result through these selected CARs. Moreover, for those unclassified cases, we will classify them with the Naïve Bayes Classifier. The experimental results show that combining the advantages of these two different classifiers better classification result can be obtained than with a single classifier.

论文关键词:Association classification,Text classification,Text mining,Text categorization

论文评审过程:Received 24 November 2009, Revised 25 March 2010, Accepted 4 April 2010, Available online 28 April 2010.

论文官网地址:https://doi.org/10.1016/j.knosys.2010.04.004