Attribute susceptibility and entropy based data anonymization to improve users community privacy and utility in publishing data
作者:Abdul Majeed, Sungchang Lee
摘要
User attributes affect community (i.e., a group of people with some common properties/attributes) privacy in users’ data publishing because some attributes may expose multiple users’ identities and their associated sensitive information during published data analysis. User attributes such as gender, age, and race, may allow an adversary to form users’ communities based on their values, and launch sensitive information inference attack subsequently. As a result, explicit disclosure of private information of a specific users’ community can occur from the privacy preserved published data. Each item of user attributes impacts users’ community privacy differently, and some types of attributes are highly susceptible. More susceptible types of attributes enable multiple users’ unique identifications and sensitive information inferences more easily, and their presence in published data increases users’ community privacy risks. Most of the existing privacy models ignore the impact of susceptible attributes on user’s community privacy and they mainly focus on preserving the individual privacy in the released data. This paper presents a novel data anonymization algorithm that significantly improves users’ community privacy without sacrificing the guarantees on anonymous data utility in publishing data. The proposed algorithm quantifies the susceptibility of each attribute present in user’s dataset to effectively preserve users’ community privacy. Data generalization is performed adaptively by considering both user attributes’ susceptibility and entropy simultaneously. The proposed algorithm controls over-generalization of the data to enhance anonymous data utility for the legitimate information consumers. Due to the widespread applications of social networks (SNs), we focused on the SN users’ community privacy preserved and utility enhanced anonymous data publishing. The simulation results obtained from extensive experiments, and comparisons with the existing algorithms show the effectiveness of the proposed algorithm and verify the aforementioned claims.
论文关键词:User attributes, Anonymization, Susceptibility, Entropy, Community privacy, Utility, Social network
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10489-020-01656-w