Exact Instruction Fetch Scheduler Policy in Modern NVIDIA GPUs
Determine the exact instruction fetch scheduler policy used by NVIDIA Ampere Streaming Multiprocessor sub-cores, including the precise rules for warp selection and switching, how Instruction Buffer occupancy constrains fetch, and how the fetch scheduler coordinates with the issue scheduler.
References
We could not confirm the exact instruction fetch policy with our experiments, but it has to be similar to the issue policy; otherwise, the condition of not finding a valid instruction in the Instruction Buffer would happen relatively often, and we have not observed this in our experiments.
— Analyzing Modern NVIDIA GPU cores
(2503.20481 - Huerta et al., 26 Mar 2025) in Section 5.2 (Front-end)