Analysis of data complexity measures for classification
作者:
Highlights:
•
摘要
The study of data complexity metrics is an emergent area in the field of data mining and is focused on the analysis of several data set characteristics to extract knowledge from them. This information can be used to support the election of the proper classification algorithm.This paper addresses the analysis of the relationship between data complexity measures and classifiers behavior. Each one of the metrics is evaluated covering its range of values and studying the classifiers accuracy on these values.The results offer information about the usefullness of these measures, and which of them allow us to analyze the nature of the input data set and help us to decide which classification method could be the most promising one.
论文关键词:Data complexity,Class overlapping,Class separability,Classification
论文评审过程:Available online 7 March 2013.
论文官网地址:https://doi.org/10.1016/j.eswa.2013.02.025