MAGE: A semantics retaining K-anonymization method for mixed data
作者:
Highlights:
•
摘要
K-anonymity is a fine approach to protecting privacy in the release of microdata for data mining. Microaggregation and generalization are two typical methods to implement k-anonymity. But both of them have some defects on anonymizing mixed microdata. To address the problem, we propose a novel anonymization method, named MAGE, which can retain more semantics than generalization and microaggregation in dealing with mixed microdata. The idea of MAGE is to combine the mean vector of numerical data with the generalization values of categorical data as a clustering centroid and to use it as incarnation of the tuples in the corresponding cluster. We also propose an efficient TSCKA algorithm to anonymize mixed data. Experimental results show that MAGE can anonymize mixed microdata effectively and the TSCKA algorithm can achieve better trade-off between data quality and algorithm efficiency comparing with two well-known anonymization algorithms, Incognito and KACA.
论文关键词:K-anonymity,Generalization,Microaggregation,Privacy preservation
论文评审过程:Received 15 July 2011, Revised 5 October 2013, Accepted 6 October 2013, Available online 17 October 2013.
论文官网地址:https://doi.org/10.1016/j.knosys.2013.10.009