Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets

作者:

Highlights:

• A thorough analysis of oversampling for handling multi-class imbalanced datasets.

• Proposition to detect underlying structures and example types in considered classes.

• Smart oversampling based on extracted knowledge about imbalance distribution types.

• In-depth insight into the importance of selecting proper examples for oversampling.

• Guidelines that allow to design efficient classifiers for multi-class imbalanced data.

摘要

Highlights•A thorough analysis of oversampling for handling multi-class imbalanced datasets.•Proposition to detect underlying structures and example types in considered classes.•Smart oversampling based on extracted knowledge about imbalance distribution types.•In-depth insight into the importance of selecting proper examples for oversampling.•Guidelines that allow to design efficient classifiers for multi-class imbalanced data.

论文关键词:Machine learning,Imbalanced classification,Multi-class imbalance,Oversampling,Minority class types

论文评审过程:Received 29 July 2015, Revised 5 March 2016, Accepted 8 March 2016, Available online 16 March 2016, Version of Record 6 May 2016.

论文官网地址:https://doi.org/10.1016/j.patcog.2016.03.012