Offloading Fine-Grained Execution Units (Threads) Across CPUs and DPUs
Develop a general mechanism to offload fine-grained execution units—specifically threads—across CPUs and DPUs within the HeteroPod architecture and its OS Overlay, enabling cross-PU execution at thread-level granularity while preserving existing cloud-native application semantics and requiring no application modifications.
References
Offloading more fine-grained granularity, e.g., threads, is still an open challenge for future work.
— HeteroPod: XPU-Accelerated Infrastructure Offloading for Commodity Cloud-Native Applications
(2503.23952 - Yang et al., 31 Mar 2025) in Subsection "Compatibility", Limitations paragraph