Full automation of the tensor core model refinement algorithm
Develop a fully automated version of the approximation-and-refinement procedure for determining accurate models of NVIDIA GPU tensor core matrix multipliers (Algorithm 1 in Section 3.3), eliminating the current manual step of inspecting mismatches between hardware and model outputs and modifying model features, so that the process autonomously iterates until bit-accurate agreement is achieved.
Sponsor
References
However, the full automation of Algorithm~\ref{alg:refine-model} is an open problem which we leave for future research.
— Accurate Models of NVIDIA Tensor Cores
(2512.07004 - Khattak et al., 7 Dec 2025) in Section 3.3 (Matrix Multiplier Model Approximation and Refinement), after Algorithm 1