Learning from Imbalanced Data Sets with Weighted Cross-Entropy Function

作者:Yuri Sousa Aurelio, Gustavo Matheus de Almeida, Cristiano Leite de Castro, Antonio Padua Braga

摘要

This paper presents a novel approach to deal with the imbalanced data set problem in neural networks by incorporating prior probabilities into a cost-sensitive cross-entropy error function. Several classical benchmarks were tested for performance evaluation using different metrics, namely G-Mean, area under the ROC curve (AUC), adjusted G-Mean, Accuracy, True Positive Rate, True Negative Rate and F1-score. The obtained results were compared to well-known algorithms and showed the effectiveness and robustness of the proposed approach, which results in well-balanced classifiers given different imbalance scenarios.

论文关键词:Multilayer perceptron, Imbalanced data, Classification problem, Back-propagation, Cost-sensitive function

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-018-09977-1