An automatic sampling ratio detection method based on genetic algorithm for imbalanced data classification

作者:

Highlights:

• Random sampling ratio leads to poor and unstable classification performance.

• We propose algorithms for automatically determining sampling ratio of resampling.

• Three scenarios demonstrate proposed algorithms outperformed random sampling ratio.

摘要

•Random sampling ratio leads to poor and unstable classification performance.•We propose algorithms for automatically determining sampling ratio of resampling.•Three scenarios demonstrate proposed algorithms outperformed random sampling ratio.

论文关键词:Imbalanced data classification,Sampling methods,Sampling ratio,Genetic algorithm

论文评审过程:Received 12 April 2020, Revised 12 January 2021, Accepted 16 January 2021, Available online 2 February 2021, Version of Record 9 February 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.106800