Identify useful inductive biases for natural language and reasoning in LLMs
Determine which inductive biases in large language models trained on natural language are useful, particularly for complex reasoning tasks, so that these models exhibit desirable properties beyond low training and test loss.
References
As opposed to computer vision (Klindt et al., 2021; Geirhos et al., 2018; 2020; 2022; Török et al., 2022; Offert & Bell, 2021; Goyal & Bengio, 2022; Papa et al., 2022), it is unclear what kind of inductive biases are useful for natural languages, especially for more complex tasks such as reasoning.
— Position: Understanding LLMs Requires More Than Statistical Generalization
(2405.01964 - Reizinger et al., 3 May 2024) in Section 4, item (iii) Inductive biases