Multilabel classifiers with a probabilistic thresholding strategy

作者:

Highlights:

摘要

In multilabel classification tasks the aim is to find hypotheses able to predict, for each instance, a set of classes or labels rather than a single one. Some state-of-the-art multilabel learners use a thresholding strategy, which consists in computing a score for each label and then predicting the set of labels whose score is higher than a given threshold. When this score is the estimated posterior probability, the selected threshold is typically 0.5.In this paper we introduce a family of thresholding strategies which take into account the posterior probability of all possible labels to determine a different threshold for each instance. Thus, we exploit some kind of interdependence among labels to compute this threshold, which is optimal regarding a given expected loss function. We found experimentally that these strategies outperform other thresholding options for multilabel classification. They provide an efficient method to implement a learner which considers the interdependence among labels in the sense that the overall performance of the prediction of a set of labels prevails over that of each single label.

论文关键词:Multilabel classification,Thresholding strategies,Posterior probability,Expected loss

论文评审过程:Received 26 January 2011, Revised 7 July 2011, Accepted 5 August 2011, Available online 11 August 2011.

论文官网地址:https://doi.org/10.1016/j.patcog.2011.08.007