(α, k)-anonymous data publishing

作者:Raymond Wong, Jiuyong Li, Ada Fu, Ke Wang

摘要

Privacy preservation is an important issue in the release of data for mining purposes. The k-anonymity model has been introduced for protecting individual identification. Recent studies show that a more sophisticated model is necessary to protect the association of individuals to sensitive information. In this paper, we propose an (α, k)-anonymity model to protect both identifications and relationships to sensitive information in data. We discuss the properties of (α, k)-anonymity model. We prove that the optimal (α, k)-anonymity problem is NP-hard. We first present an optimal global-recoding method for the (α, k)-anonymity problem. Next we propose two scalable local-recoding algorithms which are both more scalable and result in less data distortion. The effectiveness and efficiency are shown by experiments. We also describe how the model can be extended to more general cases.

论文关键词:Privacy, Data mining, Anonymity, Privacy preservation, Data publishing

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-008-0075-2