Self-labeling techniques for semi-supervised time series classification: an empirical study

作者:Mabel González, Christoph Bergmeir, Isaac Triguero, Yanet Rodríguez, José M. Benítez

摘要

An increasing amount of unlabeled time series data available render the semi-supervised paradigm a suitable approach to tackle classification problems with a reduced quantity of labeled data. Self-labeled techniques stand out from semi-supervised classification methods due to their simplicity and the lack of strong assumptions about the distribution of the labeled and unlabeled data. This paper addresses the relevance of these techniques in the time series classification context by means of an empirical study that compares successful self-labeled methods in conjunction with various learning schemes and dissimilarity measures. Our experiments involve 35 time series datasets with different ratios of labeled data, aiming to measure the transductive and inductive classification capabilities of the self-labeled methods studied. The results show that the nearest-neighbor rule is a robust choice for the base classifier. In addition, the amending and multi-classifier self-labeled-based approaches reveal a promising attempt to perform semi-supervised classification in the time series context.

论文关键词:Semi-supervised classification, Self-labeled, Time series classification, Semi-supervised learning, Self-training

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-017-1090-9