Generation of pornographic blacklist and its incremental update using an inverse chi-square based method

作者:

Highlights:

摘要

This study presented an inverse chi-square based web content classification system that works along with an incremental update mechanism for incremental generation of pornographic blacklist. The proposed system, as indicated from the experimental results, can classify bilingual (English and Chinese) web pages at an average precision rate of 97.11%; while maintaining a favorably low false positive rate. Such satisfactory performance was obtained under a cost-effective parameter configuration used in inverse chi-square calculations. The proposed incremental update mechanism operates on the linking structure of pornographic hubs to locate newly added pornographic sites. The resulting blacklist has been empirically verified to be comparatively responsive to the growth dynamics of pornography sites than three public domain blacklists.

论文关键词:Pornographic blacklist,Incremental update,Web content classification,Inverse chi-square function

论文评审过程:Received 28 January 2008, Revised 29 April 2008, Accepted 2 May 2008, Available online 17 June 2008.

论文官网地址:https://doi.org/10.1016/j.ipm.2008.05.001