Impact of limited FP64 performance on observed GPU results
Determine the extent to which limited double-precision (FP64) support on the GPU contributed to the observed performance outcomes of AoS-to-SoA annotated SPH kernels.
References
It is also not clear what role the inferior support for FP64 on the GPU plays for out experiments.
— Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++
(2502.16517 - Radtke et al., 23 Feb 2025) in Section 6.4 Results — GPU offloading