Effect of alignment on non-numeric LLM-as-a-judge evaluations
Determine the effect of alignment (e.g., instruction tuning and preference tuning) on LLM-as-a-judge evaluations that use natural-language labels or ranking outputs rather than numerical scores.
References
This study is limited to the target task using only numerical scores, and the effect of alignment on evaluations with natural language labels and rankings remains unresolved.
— Exploring the Effects of Alignment on Numerical Bias in Large Language Models
(2601.16444 - Sato et al., 23 Jan 2026) in Limitations, item (ii)