Extend Shadowheart SGD to statistical heterogeneity
Extend the Shadowheart SGD framework and analysis to the setting with statistical heterogeneity across workers (non-identically distributed data), while preserving the model of arbitrary computation and communication heterogeneity and compressed, asynchronous centralized training.
References
Due to our in-depth focus on device heterogeneity and the challenges that need to be overcome, we do not consider statistical heterogeneity, and leave an extension to this setup to future work.
                — Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity
                
                (2402.04785 - Tyurin et al., 7 Feb 2024) in Section 1 (Introduction)