Instance weighting versus threshold adjusting for cost-sensitive classification

作者:Huimin Zhao

摘要

In real-world classification problems, different types of misclassification errors often have asymmetric costs, thus demanding cost-sensitive learning methods that attempt to minimize average misclassification cost rather than plain error rate. Instance weighting and post hoc threshold adjusting are two major approaches to cost-sensitive classifier learning. This paper compares the effects of these two approaches on several standard, off-the-shelf classification methods. The comparison indicates that the two approaches lead to similar results for some classification methods, such as Naïve Bayes, logistic regression, and backpropagation neural network, but very different results for other methods, such as decision tree, decision table, and decision rule learners. The findings from this research have important implications on the selection of the cost-sensitive classifier learning approach as well as on the interpretation of a recently published finding about the relative performance of Naïve Bayes and decision trees.

论文关键词:Classification, Cost-sensitive learning, Instance weighting, Threshold adjusting, ROC curve

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-007-0079-1