Some experiments in the use of clustering for data validation

作者:

Highlights:

摘要

Previous work has demonstrated the feasibility of using clustering to detect errors in small databases. Singleton clusters, containing only one record, often represent errors. Use of a new clustering algorithm oriented to this application provided improvements in time complexity without degradation of the error detection performance in a larger collection of data.

论文关键词:Clustering,data errors,data validation

论文评审过程:Received 25 October 1989, Revised 1 February 1990, Available online 10 June 2003.

论文官网地址:https://doi.org/10.1016/0306-4379(90)90026-L