MMR: An algorithm for clustering categorical data using Rough Set Theory

作者:

Highlights:

摘要

A variety of cluster analysis techniques exist to group objects having similar characteristics. However, the implementation of many of these techniques is challenging due to the fact that much of the data contained in today’s databases is categorical in nature. While there have been recent advances in algorithms for clustering categorical data, some are unable to handle uncertainty in the clustering process while others have stability issues. This research proposes a new algorithm for clustering categorical data, termed Min–Min-Roughness (MMR), based on Rough Set Theory (RST), which has the ability to handle the uncertainty in the clustering process.

论文关键词:Cluster analysis,Categorical data,Rough Set Theory,Data mining

论文评审过程:Received 5 August 2006, Revised 18 April 2007, Accepted 29 May 2007, Available online 13 June 2007.

论文官网地址:https://doi.org/10.1016/j.datak.2007.05.005