Head-to-Head Comparison of B1 Token Entropy and Logit-Based Uncertainty Methods
Develop and run a formal head-to-head comparison between B1 token-entropy–based uncertainty detection and logit-based alternatives such as Semantic Energy, LogTokU, and PRO on matched datasets, with special attention to multi-cluster regimes, to quantify relative discrimination performance.
References
Formal head-to-head comparison on matched data remains future work.
— The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
(2603.24124 - Liu, 25 Mar 2026) in Section 6 — Exp 12: Connection to logit-based remedies