Cancer classification with data augmentation based on generative adversarial networks

作者:Kaimin Wei, Tianqi Li, Feiran Huang, Jinpeng Chen, Zefan He

摘要

Accurate diagnosis is a significant step in cancer treatment. Machine learning can support doctors in prognosis decision-making, and its performance is always weakened by the high dimension and small quantity of genetic data. Fortunately, deep learning can effectively process the high dimensional data with growing. However, the problem of inadequate data remains unsolved and has lowered the performance of deep learning. To end it, we propose a generative adversarial model that uses non target cancer data to help target generator training. We use the reconstruction loss to further stabilize model training and improve the quality of generated samples. We also present a cancer classification model to optimize classification performance. Experimental results prove that mean absolute error of cancer gene made by our model is 19.3% lower than DC-GAN, and the classification accuracy rate of our produced data is higher than the data created by GAN. As for the classification model, the classification accuracy of our model reaches 92.6%, which is 7.6% higher than the model without any generated data.

论文关键词:data mining, cancer data analysis, deep learning, generative adversarial networks

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11704-020-0025-x