Vertical Federated Learning Hybrid Local Pre-training
Abstract: Vertical Federated Learning (VFL), which has a broad range of real-world applications, has received much attention in both academia and industry. Enterprises aspire to exploit more valuable features of the same users from diverse departments to boost their model prediction skills. VFL addresses this demand and concurrently secures individual parties from exposing their raw data. However, conventional VFL encounters a bottleneck as it only leverages aligned samples, whose size shrinks with more parties involved, resulting in data scarcity and the waste of unaligned data. To address this problem, we propose a novel VFL Hybrid Local Pre-training (VFLHLP) approach. VFLHLP first pre-trains local networks on the local data of participating parties. Then it utilizes these pre-trained networks to adjust the sub-model for the labeled party or enhance representation learning for other parties during downstream federated learning on aligned data, boosting the performance of federated models. The experimental results on real-world advertising datasets, demonstrate that our approach achieves the best performance over baseline methods by large margins. The ablation study further illustrates the contribution of each technique in VFLHLP to its overall performance.
- Federated learning: Challenges, methods, and future directions. IEEE signal processing magazine, 37(3):50–60, 2020a.
- A survey on federated learning. Knowledge-Based Systems, 216:106775, 2021a.
- Vertical federated learning. arXiv preprint arXiv:2211.12814, 2022.
- Vertical federated learning: Challenges, methodologies and experiments. arXiv preprint arXiv:2202.04309, 2022.
- A vertical federated learning method for interpretable scorecard and its application in credit scoring. arXiv preprint arXiv:2009.06218, 2020.
- Confederated machine learning on horizontally and vertically separated medical data for large-scale health system intelligence. arXiv preprint arXiv:1910.02109, 2019a.
- Vertical federated learning based privacy-preserving cooperative sensing in cognitive radio networks. In 2020 IEEE Globecom Workshops (GC Wkshps, pages 1–6. IEEE, 2020.
- Fedcvt: Semi-supervised vertical federated learning with cross-view training. ACM Transactions on Intelligent Systems and Technology (TIST), 13(4):1–16, 2022.
- Self-supervised vertical federated learning. In Workshop on Federated Learning: Recent Advances and New Challenges (in Conjunction with NeurIPS 2022), 2022a.
- A hybrid self-supervised learning framework for vertical federated learning. arXiv preprint arXiv:2208.08934, 2022.
- A communication efficient collaborative learning framework for distributed features. arXiv preprint arXiv:1912.11187, 2019b.
- Asysqn: Faster vertical federated learning algorithms with better computation resource utilization. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 3917–3927, 2021b.
- Communication-efficient vertical federated learning. Algorithms, 15(8):273, 2022.
- Efficient asynchronous vertical federated learning via gradient prediction and double-end sparse compression. In 2020 16th international conference on control, automation, robotics and vision (ICARCV), pages 291–296. IEEE, 2020b.
- Efficient batch homomorphic encryption for vertically federated xgboost. arXiv preprint arXiv:2112.04261, 2021.
- Compressed-vfl: Communication-efficient learning with vertically partitioned data. In International Conference on Machine Learning, pages 2738–2766. PMLR, 2022b.
- Semi-supervised cross-silo advertising with partial knowledge transfer. arXiv e-prints, pages arXiv–2205, 2022.
- Secure and efficient federated transfer learning. In 2019 IEEE international conference on big data (Big Data), pages 2569–2576. IEEE, 2019.
- Siwei Feng. Vertical federated learning-based feature selection with non-overlapping sample utilization. Expert Systems with Applications, 208:118097, 2022.
- Multi-view federated learning with data collaboration. In 2022 14th International Conference on Machine Learning and Computing (ICMLC), pages 178–183, 2022.
- Vertical federated knowledge transfer via representation distillation. In FL-IJCAI workshop, 2022.
- Semi-supervised federated heterogeneous transfer learning. Knowledge-Based Systems, 252:109384, 2022.
- A survey on contrastive self-supervised learning. Technologies, 9(1):2, 2020.
- Self-supervised learning: Generative or contrastive. IEEE transactions on knowledge and data engineering, 35(1):857–876, 2021.
- Pseudo-labeling and confirmation bias in deep semi-supervised learning. In 2020 International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, 2020.
- Vime: Extending the success of self-and semi-supervised learning to tabular domain. Advances in Neural Information Processing Systems, 33:11033–11043, 2020.
- Tabnet: Attentive interpretable tabular learning. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 6679–6687, 2021.
- Subtab: Subsetting features of tabular data for self-supervised representation learning. Advances in Neural Information Processing Systems, 34:18853–18865, 2021.
- Scarf: Self-supervised contrastive learning using random feature corruption. arXiv preprint arXiv:2106.15147, 2021.
- Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2):1–19, 2019.
- Contrastive representation learning: A framework and review. Ieee Access, 8:193907–193934, 2020.
- Click-through rate prediction. kaggle., 2014. URL https://kaggle.com/competitions/avazu-ctr-prediction.
- Criteo Challenge. Criteo display advertising challenge, 2014. URL https://www.kaggle.com/datasets/mrkmakr/criteo-dataset. Access on 28 Oct, 2022.
- Exploring simple siamese representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 15750–15758, 2021.
- Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems, 33:21271–21284, 2020.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9729–9738, 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.