Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Convergence in Federated Learning: A Contribution-Aware Asynchronous Approach (2402.10991v4)

Published 16 Feb 2024 in cs.LG and cs.AI

Abstract: Federated Learning (FL) is a distributed machine learning paradigm that allows clients to train models on their data while preserving their privacy. FL algorithms, such as Federated Averaging (FedAvg) and its variants, have been shown to converge well in many scenarios. However, these methods require clients to upload their local updates to the server in a synchronous manner, which can be slow and unreliable in realistic FL settings. To address this issue, researchers have developed asynchronous FL methods that allow clients to continue training on their local data using a stale global model. However, most of these methods simply aggregate all of the received updates without considering their relative contributions, which can slow down convergence. In this paper, we propose a contribution-aware asynchronous FL method that takes into account the staleness and statistical heterogeneity of the received updates. Our method dynamically adjusts the contribution of each update based on these factors, which can speed up convergence compared to existing methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Cafe: Carbon-aware federated learning in geographically distributed data centers. arXiv preprint arXiv:2311.03615, 2023a.
  2. Client clustering for energy-efficient clustered federated learning in wireless networks. In Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing, pages 718–723, 2023.
  3. Mobility improves the convergence of asynchronous federated learning. arXiv preprint arXiv:2206.04742, 2022.
  4. Federated learning with instance-dependent noisy labels. arXiv preprint arXiv:2312.10324, 2023a.
  5. Accelerating hybrid federated learning convergence under partial participation. arXiv preprint arXiv:2304.05397, 2023b.
  6. Accelerating hybrid federated learning convergence under partial participation. arXiv preprint arXiv:2304.05397, 2023c.
  7. Particle filter slam for vehicle localization. arXiv preprint arXiv:2402.07429, 2024a.
  8. News recommendation with attention mechanism. arXiv preprint arXiv:2402.07422, 2024b.
  9. Financial time-series forecasting: Towards synergizing performance and interpretability within a hybrid machine learning approach. arXiv preprint arXiv:2401.00534, 2023.
  10. Decoding sentiments: Enhancing covid-19 tweet analysis through bert-rcnn fusion. Journal of Theory and Practice of Engineering Science, 4(01):86–93, 2024.
  11. Et-dm: Text to image via diffusion model with efficient transformer. Displays, 80:102568, 2023.
  12. Deep learning in photovoltaic power generation forecasting: Cnn-lstm hybrid neural network exploration and research. In The 3rd International scientific and practical conference "Technologies in education in schools and universities"(January 23-26, 2024) Athens, Greece. International Science Group. 2024. 363 p., page 295, 2024.
  13. Unveiling the future navigating next-generation ai frontiers and innovations in application. International Journal of Computer Science and Information Technology, 1(1):147–156, 2023b.
  14. Optimizing science question ranking through model and retrieval-augmented generation. International Journal of Computer Science and Information Technology, 1(1):124–130, 2023.
  15. Deepgi: An automated approach for gastrointestinal tract segmentation in mri scans. arXiv preprint arXiv:2401.15354, 2024.
  16. Will_go at semeval-2020 task 3: An accurate model for predicting the (graded) effect of context in word similarity based on bert. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 301–306, 2020.
  17. Illumicore: Optimization modeling and implementation for efficient vnf placement. In 2021 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), pages 1–7. IEEE, 2021.
  18. Edgegym: A reinforcement learning environment for constraint-aware nfv resource allocation. In 2023 IEEE 2nd International Conference on AI in Cybersecurity (ICAIC), pages 1–7. IEEE, 2023.
  19. Optimal resource allocation in sdn/nfv-enabled networks via deep reinforcement learning. In 2022 IEEE Ninth International Conference on Communications and Networking (ComNet), pages 1–7. IEEE, 2022.
  20. Application of machine learning in financial risk early warning and regional prevention and control: A systematic analysis based on shap. WORLD TRENDS, REALITIES AND ACCOMPANYING PROBLEMS OF DEVELOPMENT, 331, 2023.
  21. Smartfix: Leveraging machine learning for proactive equipment maintenance in industry 4.0. In The 2nd International scientific and practical conference "Innovations in education: prospects and challenges of today"(January 16-19, 2024) Sofia, Bulgaria. International Science Group. 2024. 389 p., page 313, 2024.
  22. Automatic recognition of static phenomena in retouched images: A novel approach. In The 1st International scientific and practical conference "Advanced technologies for the implementation of new ideas"(January 09-12, 2024) Brussels, Belgium. International Science Group. 2024. 349 p., page 287, 2024.
  23. Graph convolutional network with sample and feature weights for alzheimer’s disease diagnosis. Information Processing & Management, 59(4):102952, 2022.
  24. Dual-graph learning convolutional networks for interpretable alzheimer’s disease diagnosis. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 406–415. Springer, 2022.
  25. Mrmrp: multi-source review-based model for rating prediction. In Database Systems for Advanced Applications: 25th International Conference, DASFAA 2020, Jeju, South Korea, September 24–27, 2020, Proceedings, Part II 25, pages 20–35. Springer, 2020.
  26. Binary code summarization: Benchmarking chatgpt/gpt-4 and other large language models. arXiv preprint arXiv:2312.09601, 2023a.
  27. Prometheus: Infrastructure security posture analysis with ai-generated attack graphs. arXiv preprint arXiv:2312.13119, 2023b.
  28. Hengyi Zang. Precision calibration of industrial 3d scanners: An ai-enhanced approach for improved measurement accuracy. Global Academic Frontiers, 2(1):27–37, 2024.
  29. Transfer-learning-based network traffic automatic generation framework. In 2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP), pages 851–854. IEEE, 2021.
  30. Implementation of computer vision technology based on artificial intelligence for medical image analysis. International Journal of Computer Science and Information Technology, 1(1):69–76, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Changxin Xu (6 papers)
  2. Yuxin Qiao (10 papers)
  3. Zhanxin Zhou (1 paper)
  4. Fanghao Ni (4 papers)
  5. Jize Xiong (3 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets