Deployment choice: densely populated accelerator servers vs. homogeneous per-server devices
Determine whether cloud providers should deploy densely populated accelerator servers or attach homogeneous accelerator devices per server to optimize total cost of ownership and sustainability while maintaining performance isolation and manageability for accelerator-as-a-service at cloud scale.
References
For example, using densely populated accelerator servers or attaching homogeneous accelerator devices per server is an open question.
— Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild
(2407.10098 - Zhao et al., 2024) in Section 6, Accelerator cost and cloud-scale management