Scalable density-based clustering with quality guarantees using random projections

作者:Johannes Schneider, Michail Vlachos

摘要

Clustering offers significant insights in data analysis. Density-based algorithms have emerged as flexible and efficient techniques, able to discover high-quality and potentially irregularly shaped clusters. Here, we present scalable density-based clustering algorithms using random projections. Our clustering methodology achieves a speedup of two orders of magnitude compared with equivalent state-of-art density-based techniques, while offering analytical guarantees on the clustering quality in Euclidean space. Moreover, it does not introduce difficult to set parameters. We provide a comprehensive analysis of our algorithms and comparison with existing density-based algorithms.

论文关键词:Density-based clustering, Random projections, Nearest neighbors

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10618-017-0498-x