Origin of LLM probability judgment incoherence in autoregression
Establish whether the mechanism by which GPT-4, GPT-3.5-turbo, LLaMA-2-70b, and LLaMA-2-7b generate probability judgments originates from the implementation of the autoregressive training objective used in these models.
Sponsor
References
These structures offer insights into the underlying mechanisms employed by LLMs in the formation of probability judgments. We conjecture that this process originates from the implementation of autoregression for the four LLMs.
— Incoherent Probability Judgments in Large Language Models
(2401.16646 - Zhu et al., 30 Jan 2024) in Section 6 (Discussion)