Vertical federated learning-based feature selection with non-overlapping sample utilization

作者:

Highlights:

• In this paper, we bridge this gap by proposing a novel VFL-based feature selection method—Vertical Federated Learning-based Feature Selection (VFLFS). To the best of our knowledge, this is the first deep learning-based vertical federated learning approach with feature selection.

• A strategy to make use of non-overlapping samples is also proposed to improve feature selection effectiveness.

• The proposed VFLFS approach has been evaluated extensively based on real-world datasets. The results show that VFLFS can significantly improve model performance under VFL settings compared to four state of the art baselines, especially in conditions where a large proportion of the data samples do not overlap across data owners.

摘要

•In this paper, we bridge this gap by proposing a novel VFL-based feature selection method—Vertical Federated Learning-based Feature Selection (VFLFS). To the best of our knowledge, this is the first deep learning-based vertical federated learning approach with feature selection.•A strategy to make use of non-overlapping samples is also proposed to improve feature selection effectiveness.•The proposed VFLFS approach has been evaluated extensively based on real-world datasets. The results show that VFLFS can significantly improve model performance under VFL settings compared to four state of the art baselines, especially in conditions where a large proportion of the data samples do not overlap across data owners.

论文关键词:Feature selection,Vertical federated learning,Privacy protection,Deep learning

论文评审过程:Received 17 February 2022, Revised 13 June 2022, Accepted 5 July 2022, Available online 13 July 2022, Version of Record 18 July 2022.

论文官网地址:https://doi.org/10.1016/j.eswa.2022.118097