Latent linkage semantic kernels for collective classification of link data

作者:Yonghong Tian, Tiejun Huang, Wen Gao

摘要

Generally, links among objects demonstrate certain patterns and contain rich semantic clues. These important clues can be used to improve classification accuracy. However, many real-world link data may exhibit more complex regularity. For example, there may be some noisy links that carry no human editorial endorsement about semantic relationships. To effectively capture such regularity, this paper proposes latent linkage semantic kernels (LLSKs) by first introducing the linkage kernels to model the local and global dependency structure of a link graph and then applying the singular value decomposition (SVD) in the kernel-induced space. For the computational efficiency on large datasets, we also develop a block-based algorithm for LLSKs. A kernel-based contextual dependency network (KCDN) model is then presented to exploit the dependencies in a network of objects for collective classification. We provide experimental results demonstrating that the KCDN model, together with LLSKs, demonstrates relatively high robustness on the datasets with the complex link regularity, and the block-based computation method can scale well with varying sizes of the problem.

论文关键词:Kernel methods, Link regularity, Latent linkage semantic kernel, Kernel-based contextual dependency networks, Block-based link analysis, Collective classification

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-006-2208-9