Fusion of FFT and GEMM kernels
Investigate and develop effective kernel fusion techniques that integrate the Fast Fourier Transform (FFT) and General Matrix Multiplication (GEMM) operations despite their mismatched data access patterns and memory layouts, enabling efficient end-to-end execution in workflows such as Fourier Neural Operators.
Sponsor
References
In contrast, fusing FFT and GEMM presents unique challenges due to their mismatched data access patterns and memory layouts, and remains an open area of research.
— TurboFNO: High-Performance Fourier Neural Operator with Fused FFT-GEMM-iFFT on GPU
(2504.11681 - Wu et al., 16 Apr 2025) in Section 1, Introduction