Some experiments in the use of clustering for data validation

作者：

Highlights：

•

摘要

Previous work has demonstrated the feasibility of using clustering to detect errors in small databases. Singleton clusters, containing only one record, often represent errors. Use of a new clustering algorithm oriented to this application provided improvements in time complexity without degradation of the error detection performance in a larger collection of data.

论文关键词：Clustering,data errors,data validation

论文评审过程：Received 25 October 1989, Revised 1 February 1990, Available online 10 June 2003.

论文官网地址：https://doi.org/10.1016/0306-4379(90)90026-L

原文链接
谷歌学术
必应学术
百度学术