Multi-label classification via incremental clustering on an evolving data stream

作者:

Highlights:

• An incremental clustering-based multi-label online classification algorithm for multi-label data stream is proposed.

• To handle concept drift, our algorithm evolves with time, giving higher attention to more recent samples than older samples through a weight decay mechanism.

• Our algorithm dynamically determines the number of predicted labels based on Hoeffding inequality and the label cardinality.

• Extensive comparative experiments with the state-of-the-art algorithms validated the superior performance of our algorithm in both the stationary and concept drift settings.

摘要

•An incremental clustering-based multi-label online classification algorithm for multi-label data stream is proposed.•To handle concept drift, our algorithm evolves with time, giving higher attention to more recent samples than older samples through a weight decay mechanism.•Our algorithm dynamically determines the number of predicted labels based on Hoeffding inequality and the label cardinality.•Extensive comparative experiments with the state-of-the-art algorithms validated the superior performance of our algorithm in both the stationary and concept drift settings.

论文关键词:Multi-label classification,Incremental learning,Online learning,Clustering,Data stream,Concept drift

论文评审过程:Received 31 October 2018, Revised 5 March 2019, Accepted 1 June 2019, Available online 3 June 2019, Version of Record 11 June 2019.

论文官网地址:https://doi.org/10.1016/j.patcog.2019.06.001