From t-closeness to differential privacy and vice versa in data anonymization

作者:

Highlights:

摘要

k-anonymity and ε-differential privacy are two mainstream privacy models, the former introduced to anonymize data sets and the latter to limit the knowledge gain that results from including one individual in the data set. Whereas basic k-anonymity only protects against identity disclosure, t-closeness was presented as an extension of k-anonymity that also protects against attribute disclosure. We show here that, if not quite equivalent, t-closeness and ε-differential privacy are strongly related to one another when it comes to anonymizing data sets. Specifically, k-anonymity for the quasi-identifiers combined with ε-differential privacy for the confidential attributes yields stochastic t-closeness (an extension of t-closeness), with t a function of k and ε. Conversely, t-closeness can yield ε-differential privacy when and the assumptions made by t-closeness about the prior and posterior views of the data hold.

论文关键词:t-closeness,ε-differential privacy,Data anonymization,Statistical disclosure control,Syntactic anonymization,Semantic anonymization

论文评审过程:Received 8 July 2014, Revised 4 November 2014, Accepted 11 November 2014, Available online 20 November 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.11.011