Supporting K nearest neighbors query on high-dimensional data in P2P systems
作者:Mei Li, Wang-Chien Lee, Anand Sivasubramaniam, Jizhong Zhao
摘要
Peer-to-peer systems have been widely used for sharing and exchanging data and resources among numerous computer nodes. Various data objects identifiable with high dimensional feature vectors, such as text, images, genome sequences, are starting to leverage P2P technology. Most of the existing works have been focusing on queries on data objects with one or few attributes and thus are not applicable on high dimensional data objects. In this study, we investigate K nearest neighbors query (KNN) on high dimensional data objects in P2P systems. Efficient query algorithm and solutions that address various technical challenges raised by high dimensionality, such as search space resolution and incremental search space refinement, are proposed. An extensive simulation using both synthetic and real data sets demonstrates that our proposal efficiently supports KNN query on high dimensional data in P2P systems.
论文关键词:peer-to-peer network, K nearest neighbor algorithm, distributed systems, high dimensionality
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11704-008-0026-7