A tag-topic model for blog mining

作者:

Highlights:

摘要

Blog mining addresses the problem of mining information from blog data. Although mining blogs may share many similarities to Web and text documents, existing techniques need to be reevaluated and adapted for the multidimensional representation of blog data, which exhibit dimensions not present in traditional documents, such as tags. Blog tags are semantic annotations in blogs which can be valuable sources of additional labels for the myriad of blog documents. In this paper, we present a tag-topic model for blog mining, which is based on the Author-Topic model and Latent Dirichlet Allocation. The tag-topic model determines the most likely tags and words for a given topic in a collection of blog posts. The model has been successfully implemented and evaluated on real-world blog data.

论文关键词:Blog mining,Weblog,Tags,Author-Topic model,Isomap,Latent Dirichlet Allocation

论文评审过程:Available online 31 October 2010.

论文官网地址:https://doi.org/10.1016/j.eswa.2010.10.025