Approximating web communities using subspace decomposition

作者:

Highlights:

摘要

Herein, we propose an algorithm to approximate web communities from the topic related web pages. The approximation is achieved by subspace factorization of the topic related web pages. The factorization process reveals existing association between web pages such that the closely related web pages are extracted. We vary the approximation values to identify varied degrees of relationship between web pages. Experiments on real data sets show that the proposed algorithm reduces the impact of unrelated links and therefore can be used to control spam links in web pages.

论文关键词:Web communities,Web graphs,Subspace decomposition,Information retrieval,Community detection,Spam detection

论文评审过程:Received 22 October 2013, Revised 10 June 2014, Accepted 11 June 2014, Available online 19 June 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.06.017