Generalization of the Alignment Tax to Closed-Source GPT-Class Models and Other Domains
Ascertain whether the alignment tax—response homogenization that degrades sampling-based uncertainty estimation—generalizes to closed-source GPT-class language models and to additional domains such as code and dialogue.
References
Generalization to closed-source GPT-class models and other domains (code, dialogue) unconfirmed.
— The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
(2603.24124 - Liu, 25 Mar 2026) in Limitations (4)