Selecting features in microarray classification using ROC curves

作者:

Highlights:

摘要

We present a new method based on the ROC (Receiver Operating Characteristic) curve to efficiently select a feature subset in classifying a high-dimensional microarray dataset with a limited number of observations. Our method has two steps: (1) selecting the most relevant features to the target label using the ROC curve and (2) iteratively eliminating a redundant feature using the ROC curves. The ROC curve is strongly related with a non-parametric hypothesis testing, which must be effective for a dataset with small numerical observations. Experiments with real datasets revealed the significant performance advantage of our method over two competing feature subset selection methods.

论文关键词:Feature subset selection,cDNA microarray,ROC (Receiver Operating Characteristic) curve,Area between the ROC curve and the diagonal line (ARD),Area between the ROC curves (ABR),Non-parametric hypothesis testing,Binary classification

论文评审过程:Author links open overlay panelHiroshiMamitsuka

论文官网地址:https://doi.org/10.1016/j.patcog.2006.07.010