Scaling SCR multipathing to hundreds of paths on NVIDIA BlueField-3 DPA
Determine whether SCR (the "White-Boxing RDMA" approach implemented on NVIDIA BlueField-3 DPA) can scale its multipath transport to hundreds of distinct paths given the limited L1/L2 cache available on the DPA cores.
References
For multipathing, SCR only demonstrates two paths; given the limited L1/L2 cache in DPA, it is not clear if SCR could scale to hundreds of paths.
— An Extensible Software Transport Layer for GPU Networking
(2504.17307 - Zhou et al., 24 Apr 2025) in Section 7, Other Related Work