Some experiments in the use of clustering for data validation
作者:
Highlights:
•
摘要
Previous work has demonstrated the feasibility of using clustering to detect errors in small databases. Singleton clusters, containing only one record, often represent errors. Use of a new clustering algorithm oriented to this application provided improvements in time complexity without degradation of the error detection performance in a larger collection of data.
论文关键词:Clustering,data errors,data validation
论文评审过程:Received 25 October 1989, Revised 1 February 1990, Available online 10 June 2003.
论文官网地址:https://doi.org/10.1016/0306-4379(90)90026-L