A complete framework for accurate recognition and prognosis of COVID-19 patients based on deep transfer learning and feature classification approach

作者:Hossam Magdy Balaha, Eman M. El-Gendy, Mahmoud M. Saafan

摘要

The sudden appearance of COVID-19 has put the world in a serious situation. Due to the rapid spread of the virus and the increase in the number of infected patients and deaths, COVID-19 was declared a pandemic. This pandemic has its destructive effect not only on humans but also on the economy. Despite the development and availability of different vaccines for COVID-19, scientists still warn the citizens of new severe waves of the virus, and as a result, fast diagnosis of COVID-19 is a critical issue. Chest imaging proved to be a powerful tool in the early detection of COVID-19. This study introduces an entire framework for the early detection and early prognosis of COVID-19 severity in the diagnosed patients using laboratory test results. It consists of two phases (1) Early Diagnostic Phase (EDP) and (2) Early Prognostic Phase (EPP). In EDP, COVID-19 patients are diagnosed using CT chest images. In the current study, 5, 159 COVID-19 and 10, 376 normal computed tomography (CT) images of Egyptians were used as a dataset to train 7 different convolutional neural networks using transfer learning. Data augmentation normal techniques and generative adversarial networks (GANs), CycleGAN and CCGAN, were used to increase the images in the dataset to avoid overfitting issues. 28 experiments were applied and multiple performance metrics were captured. Classification with no augmentation yielded \(99.61\%\) accuracy by EfficientNetB7 architecture. By applying CycleGAN and CC-GAN Augmentation, the maximum reported accuracies were \(99.57\%\) and \(99.14\%\) by MobileNetV1 and VGG-16 architectures respectively. In EPP, the prognosis of the severity of COVID-19 in patients is early determined using laboratory test results. In this study, 25 different classification techniques were applied and from the different results, the highest accuracies were \(98.70\%\) and \(97.40\%\) reported by the Ensemble Bagged Trees and Tree (Fine, Medium, and Coarse) techniques respectively.

论文关键词:Computed tomography (CT), Convolutional neural network (CNN), COVID-19, Data augmentation, Generative adversarial networks (GAN), Image classification, Machine learning (ML), Transfer learning (TL)

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10462-021-10127-8