Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting
Abstract: One-shot Federated Learning (OFL) has become a promising learning paradigm, enabling the training of a global server model via a single communication round. In OFL, the server model is aggregated by distilling knowledge from all client models (the ensemble), which are also responsible for synthesizing samples for distillation. In this regard, advanced works show that the performance of the server model is intrinsically related to the quality of the synthesized data and the ensemble model. To promote OFL, we introduce a novel framework, Co-Boosting, in which synthesized data and the ensemble model mutually enhance each other progressively. Specifically, Co-Boosting leverages the current ensemble model to synthesize higher-quality samples in an adversarial manner. These hard samples are then employed to promote the quality of the ensemble model by adjusting the ensembling weights for each client model. Consequently, Co-Boosting periodically achieves high-quality data and ensemble models. Extensive experiments demonstrate that Co-Boosting can substantially outperform existing baselines under various settings. Moreover, Co-Boosting eliminates the need for adjustments to the client's local training, requires no additional data or model transmission, and allows client models to have heterogeneous architectures.
- Federated learning based on dynamic regularization. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=B7v4QMR6Z9w.
- Improving generalization in federated learning by seeking flat minima. In European Conference on Computer Vision, pp. 654–672. Springer, 2022.
- Learning style-invariant robust representation for generalizable visual instance retrieval. In Proceedings of the 31st ACM International Conference on Multimedia, pp. 6171–6180, 2023a.
- Domain generalized stereo matching via hierarchical visual transformation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9559–9568, June 2023b.
- Data-free learning of student networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3514–3522, 2019.
- On the importance and applicability of pre-training for federated learning. In The Eleventh International Conference on Learning Representations, 2022.
- Dispfl: Towards communication-efficient personalized federated learning via decentralized sparse training. In International Conference on Machine Learning, pp. 4587–4604. PMLR, 2022.
- Fedgamma: Federated learning with global sharpness-aware minimization. IEEE Transactions on Neural Networks and Learning Systems, pp. 1–14, 2023. doi: 10.1109/TNNLS.2023.3304453.
- Heterogeneity for the win: One-shot federated clustering. In International Conference on Machine Learning, pp. 2611–2620. PMLR, 2021.
- Heterofl: Computation and communication efficient federated learning for heterogeneous clients. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=TNkPBBYFkXg.
- Towards addressing label skews in one-shot federated learning. In The Eleventh International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=rzrqh85f4Sc.
- What can be transferred: Unsupervised domain adaptation for endoscopic lesions segmentation. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 4023–4032, 2020.
- Domain-adversarial training of neural networks. The journal of machine learning research, 17(1):2096–2030, 2016.
- Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572, 2014.
- One-shot federated learning. arXiv preprint arXiv:1902.11175, 2019.
- Deep residual learning for image recognition. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 770–778, 2016.
- Data-free one-shot federated learning under very high statistical heterogeneity. In The Eleventh International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=_hb4vM3jspB.
- Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314–1324, 2019.
- Advances and open problems in federated learning. Foundations and Trends® in Machine Learning, 14(1–2):1–210, 2021.
- Scaffold: Stochastic controlled averaging for federated learning. In International Conference on Machine Learning, pp. 5132–5143. PMLR, 2020.
- Learning multiple layers of features from tiny images. Master thesis, 2009.
- Ya Le and Xuan Yang. Tiny imagenet visual recognition challenge. CS 231N, 7(7):3, 2015.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- Gradient harmonized single-stage detector. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pp. 8577–8584, 2019.
- Deeper, broader and artier domain generalization. In Proceedings of the IEEE international conference on computer vision, pp. 5542–5550, 2017.
- Hard sample matters a lot in zero-shot quantization. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 24417–24426, 2023a.
- Practical one-shot federated learning for cross-silo setting. In Zhi-Hua Zhou (ed.), Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, pp. 1484–1490. International Joint Conferences on Artificial Intelligence Organization, 8 2021. doi: 10.24963/ijcai.2021/205. URL https://doi.org/10.24963/ijcai.2021/205. Main Track.
- Federated learning: Challenges, methods, and future directions. IEEE signal processing magazine, 37(3):50–60, 2020a.
- Federated optimization in heterogeneous networks. Proceedings of Machine Learning and Systems, 2:429–450, 2020b.
- Federated domain generalization: A survey. arXiv preprint arXiv:2306.01334, 2023b.
- Ensemble distillation for robust model fusion in federated learning. Advances in Neural Information Processing Systems, 33:2351–2363, 2020.
- Ppgan: Privacy-preserving generative adversarial network. In 2019 IEEE 25Th international conference on parallel and distributed systems (ICPADS), pp. 985–989. IEEE, 2019.
- Shufflenet v2: Practical guidelines for efficient cnn architecture design. In Proceedings of the European conference on computer vision (ECCV), pp. 116–131, 2018.
- Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and statistics, pp. 1273–1282. PMLR, 2017.
- A survey on security and privacy of federated learning. Future Generation Computer Systems, 115:619–640, 2021.
- Reading digits in natural images with unsupervised feature learning. 2011.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Sade: A self-adaptive expert for multi-dataset question answering. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–5. IEEE, 2023.
- Deep co-training for semi-supervised image recognition. In Proceedings of the european conference on computer vision (eccv), pp. 135–152, 2018.
- Generalized federated learning via sharpness aware minimization. In International Conference on Machine Learning, pp. 18250–18280. PMLR, 2022.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
- Emotion-prior awareness network for emotional video captioning. In Proceedings of the 31st ACM International Conference on Multimedia, pp. 589–600, 2023.
- Virtual homogeneity learning: Defending against data heterogeneity in federated learning. In International Conference on Machine Learning, pp. 21111–21132. PMLR, 2022.
- Diversity can be transferred: Output diversification for white-and black-box attacks. Advances in neural information processing systems, 33:4536–4548, 2020.
- Modeldb: a system for machine learning model management. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, pp. 1–3, 2016.
- Man-in-the-middle attacks against machine learning classifiers via malicious generative models. IEEE Transactions on Dependable and Secure Computing, 18(5):2074–2087, 2021. doi: 10.1109/TDSC.2020.3021008.
- Dafkd: Domain-aware federated knowledge distillation. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 20412–20421, 2023.
- Tackling the objective inconsistency problem in heterogeneous federated optimization. Advances in neural information processing systems, 33:7611–7623, 2020.
- Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747, 2017.
- Exploring one-shot semi-supervised federated learning with a pre-trained diffusion model. arXiv preprint arXiv:2305.04063, 2023.
- Deconfounded video moment retrieval with causal intervention. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1–10, 2021.
- Video moment retrieval with cross-modal neural architecture search. IEEE Transactions on Image Processing, 31:1204–1216, 2022.
- Dreaming to distill: Data-free knowledge transfer via deepinversion. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 8715–8724, 2020.
- See through gradients: Image batch recovery via gradinversion. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 16337–16346, 2021.
- Dense: Data-free one-shot federated learning. Advances in Neural Information Processing Systems, 35:21414–21428, 2022a.
- Fine-tuning global model via data-free knowledge distillation for non-iid federated learning. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 10174–10183, 2022b.
- Federated domain generalization with generalization adjustment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3954–3963, 2023.
- Adversarial robustness through the lens of causality. In International Conference on Learning Representations, 2022c.
- Exploring sparse spatial relation in graph inference for text-based vqa. IEEE Transactions on Image Processing, 32:5060–5074, 2023. doi: 10.1109/TIP.2023.3310332.
- Distilled one-shot federated learning. arXiv preprint arXiv:2009.07999, 2020.
- Data-free knowledge distillation for heterogeneous federated learning. In International conference on machine learning, pp. 12878–12889. PMLR, 2021.
- When foundation model meets federated learning: Motivations, challenges, and future directions. arXiv preprint arXiv:2306.15546, 2023.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.