Labelset topic model for multi-label document classification

作者:Ximing Li, Jihong Ouyang, Xiaotang Zhou

摘要

It has recently been suggested that assuming independence between labels is not suitable for real-world multi-label classification. To account for label dependencies, this paper proposes a supervised topic modeling algorithm, namely labelset topic model (LsTM). Our algorithm uses two labelset layers to capture label dependencies. LsTM offers two major advantages over existing supervised topic modeling algorithms: it is straightforward to interpret and it allows words to be assigned to combinations of labels, rather than a single label. We have performed extensive experiments on several well-known multi-label datasets. Experimental results indicate that the proposed model achieves performance on par with and often exceeding that of state-of-the-art methods both qualitatively and quantitatively.

论文关键词:Multi-label classification, Topic model, Labelset, Label dependency

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-014-0352-1