Comparisons of machine learning techniques for detecting malicious webpages

作者:

Highlights:

• 3 supervised and 2 unsupervised techniques are modeled to detect malicious webpages.

• Supervised machine learning (ML) techniques accuracies are above 89%.

• Unsupervised ML techniques accuracies have at least a silhouette coefficient of 0.87.

• Information obtained from URLs, page links, semantics and visual features of webpages.

• Chrome extension, lightweight and heavyweight classifiers and online learning are used.

摘要

•3 supervised and 2 unsupervised techniques are modeled to detect malicious webpages.•Supervised machine learning (ML) techniques accuracies are above 89%.•Unsupervised ML techniques accuracies have at least a silhouette coefficient of 0.87.•Information obtained from URLs, page links, semantics and visual features of webpages.•Chrome extension, lightweight and heavyweight classifiers and online learning are used.

论文关键词:K-Nearest Neighbor,Support Vector Machine,Naive Bayes,Affinity Propagation,K-Means,Supervised and unsupervised learning

论文评审过程:Available online 16 September 2014.

论文官网地址:https://doi.org/10.1016/j.eswa.2014.08.046