Cost-conscious comparison of supervised learning algorithms over multiple data sets

作者:

Highlights:

摘要

In the literature, there exist statistical tests to compare supervised learning algorithms on multiple data sets in terms of accuracy but they do not always generate an ordering. We propose Multi2Test, a generalization of our previous work, for ordering multiple learning algorithms on multiple data sets from “best” to “worst” where our goodness measure is composed of a prior cost term additional to generalization error. Our simulations show that Multi2Test generates orderings using pairwise tests on error and different types of cost using time and space complexity of the learning algorithms.

论文关键词:Machine learning,Statistical tests,Classifier comparison,Model selection,Model complexity

论文评审过程:Received 12 October 2010, Revised 25 September 2011, Accepted 7 October 2011, Available online 15 October 2011.

论文官网地址:https://doi.org/10.1016/j.patcog.2011.10.005