A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine

作者:

Highlights:

摘要

The aim of this study is to diagnosis of diabetes disease, which is one of the most important diseases in medical field using Generalized Discriminant Analysis (GDA) and Least Square Support Vector Machine (LS-SVM). Also, we proposed a new cascade learning system based on Generalized Discriminant Analysis and Least Square Support Vector Machine. The proposed system consists of two stages. The first stage, we have used Generalized Discriminant Analysis to discriminant feature variables between healthy and patient (diabetes) data as pre-processing process. The second stage, we have used LS-SVM in order to classification of diabetes dataset. While LS-SVM obtained 78.21% classification accuracy using 10-fold cross validation, the proposed system called GDA–LS-SVM obtained 82.05% classification accuracy using 10-fold cross validation. The robustness of the proposed system is examined using classification accuracy, k-fold cross-validation method and confusion matrix. The obtained classification accuracy is 82.05% and it is very promising compared to the previously reported classification techniques.

论文关键词:Generalized discriminant analysis,Least Square Support Vector Machine,Expert systems,Pima Indians diabetes dataset

论文评审过程:Available online 9 October 2006.

论文官网地址:https://doi.org/10.1016/j.eswa.2006.09.012