Statistical analysis of mammographic features and its classification using support vector machine

作者:

Highlights:

摘要

This study aims at designing a support vector machine (SVM)-based classifier for breast cancer detection with higher degree of accuracy. It introduces a best possible training scheme of the features extracted from the mammogram, by first selecting the kernel function and then choosing a suitable training-test partition. Prior to classification, detailed statistical analysis viz., test of significance, density estimation have been performed for identifying discriminating power of the features in between malignant and benign classes. A comparative study has been performed in respect to diagnostic measures viz., confusion matrix, sensitivity and specificity. Here we have considered two data sets from UCI machine learning database having nine and ten dimensional feature spaces for classification. Furthermore, the overall classification accuracy obtained by using the proposed classification strategy is 99.385% for dataset-I and 93.726% for dataset-II, respectively.

论文关键词:Mammogram based data,Statistical analysis,Support vector machine,Kernel function,Diagnostic measures

论文评审过程:Available online 22 May 2009.

论文官网地址:https://doi.org/10.1016/j.eswa.2009.05.045