Watch Your Head: Assembling Projection Heads to Save the Reliability of Federated Models
Abstract: Federated learning encounters substantial challenges with heterogeneous data, leading to performance degradation and convergence issues. While considerable progress has been achieved in mitigating such an impact, the reliability aspect of federated models has been largely disregarded. In this study, we conduct extensive experiments to investigate the reliability of both generic and personalized federated models. Our exploration uncovers a significant finding: \textbf{federated models exhibit unreliability when faced with heterogeneous data}, demonstrating poor calibration on in-distribution test data and low uncertainty levels on out-of-distribution data. This unreliability is primarily attributed to the presence of biased projection heads, which introduce miscalibration into the federated models. Inspired by this observation, we propose the "Assembled Projection Heads" (APH) method for enhancing the reliability of federated models. By treating the existing projection head parameters as priors, APH randomly samples multiple initialized parameters of projection heads from the prior and further performs targeted fine-tuning on locally available data under varying learning rates. Such a head ensemble introduces parameter diversity into the deterministic model, eliminating the bias and producing reliable predictions via head averaging. We evaluate the effectiveness of the proposed APH method across three prominent federated benchmarks. Experimental results validate the efficacy of APH in model calibration and uncertainty estimation. Notably, APH can be seamlessly integrated into various federated approaches but only requires less than 30\% additional computation cost for 100$\times$ inferences within large models.
- Federated learning based on dynamic regularization. arXiv preprint arXiv:2111.04263.
- Federated learning with personalization layers. arXiv preprint arXiv:1912.00818.
- On Bridging Generic and Personalized Federated Learning for Image Classification. In International Conference on Learning Representations.
- Exploiting shared representations for personalized federated learning. In International conference on machine learning, 2089–2099. PMLR.
- Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach. Advances in Neural Information Processing Systems, 33: 3557–3568.
- Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning, 1050–1059. PMLR.
- Feddc: Federated learning with non-iid data via local drift decoupling and correction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 10112–10121.
- Graves, A. 2011. Practical variational inference for neural networks. Advances in neural information processing systems, 24.
- On calibration of modern neural networks. In International conference on machine learning, 1321–1330. PMLR.
- Benchmarking neural network robustness to common corruptions and perturbations. arXiv preprint arXiv:1903.12261.
- Natural adversarial examples. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 15262–15271.
- Measuring the effects of non-identical data distribution for federated visual classification. arXiv preprint arXiv:1909.06335.
- Personalized cross-silo federated learning on non-iid data. In Proceedings of the AAAI conference on artificial intelligence, volume 35, 7865–7873.
- Neural tangent kernel: Convergence and generalization in neural networks. Advances in neural information processing systems, 31.
- Advances and open problems in federated learning. Foundations and Trends® in Machine Learning, 14(1–2): 1–210.
- Scaffold: Stochastic controlled averaging for federated learning. In International Conference on Machine Learning, 5132–5143. PMLR.
- Learning multiple layers of features from tiny images.
- Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems, 30.
- Tiny imagenet visual recognition challenge. CS 231N, 7(7): 3.
- Preservation of the global knowledge by not-true distillation in federated learning. Advances in Neural Information Processing Systems, 35: 38461–38474.
- Towards fully autonomous driving: Systems and algorithms. In 2011 IEEE intelligent vehicles symposium (IV), 163–168. IEEE.
- Federated learning on non-iid data silos: An experimental study. In 2022 IEEE 38th International Conference on Data Engineering (ICDE), 965–978. IEEE.
- Model-contrastive federated learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10713–10722.
- Ditto: Fair and robust federated learning through personalization. In International Conference on Machine Learning, 6357–6368. PMLR.
- Federated learning: Challenges, methods, and future directions. IEEE signal processing magazine, 37(3): 50–60.
- Federated optimization in heterogeneous networks. Proceedings of Machine Learning and Systems, 2: 429–450.
- Federated optimization in heterogeneous networks. Proceedings of Machine learning and systems, 2: 429–450.
- Understanding the disharmony between dropout and batch normalization by variance shift. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2682–2690.
- Ensemble distillation for robust model fusion in federated learning. Advances in Neural Information Processing Systems, 33: 2351–2363.
- Structured and efficient variational deep learning with matrix gaussian posteriors. In International conference on machine learning, 1708–1716. PMLR.
- No fear of heterogeneity: Classifier calibration for federated learning with non-iid data. Advances in Neural Information Processing Systems, 34: 5972–5984.
- A simple baseline for bayesian uncertainty in deep learning. Advances in neural information processing systems, 32.
- Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics, 1273–1282. PMLR.
- Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Scientific reports, 6(1): 1–10.
- Obtaining well calibrated probabilities using bayesian binning. In Proceedings of the AAAI conference on artificial intelligence, volume 29.
- Reading digits in natural images with unsupervised feature learning.
- Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift. Advances in neural information processing systems, 32.
- A Consistent and Differentiable Lpsubscript𝐿𝑝L_{p}italic_L start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT Canonical Calibration Error Estimator. In Advances in Neural Information Processing Systems.
- Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling. arXiv preprint arXiv:1802.09127.
- Evidential deep learning to quantify classification uncertainty. Advances in neural information processing systems, 31.
- Federated multi-task learning. Advances in neural information processing systems, 30.
- Personalized federated learning with moreau envelopes. Advances in Neural Information Processing Systems, 33: 21394–21405.
- Single model uncertainty estimation via stochastic data centering. Advances in Neural Information Processing Systems, 35: 8662–8674.
- Uncertainty estimation using a single deep deterministic neural network. In International conference on machine learning, 9690–9700. PMLR.
- Federated learning with differential privacy: Algorithms and performance analysis. IEEE Transactions on Information Forensics and Security, 15: 3454–3469.
- Bayesian learning via stochastic gradient Langevin dynamics. In Proceedings of the 28th international conference on machine learning (ICML-11), 681–688.
- Personalized Federated Learning with Feature Alignment and Classifier Collaboration. In The Eleventh International Conference on Learning Representations.
- Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2): 1–19.
- Fedala: Adaptive local aggregation for personalized federated learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, 11237–11244.
- Personalized Federated Learning with First Order Model Optimization. In International Conference on Learning Representations.
- Federated learning on non-IID data: A survey. Neurocomputing, 465: 371–390.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.