Large language models predict human sensory judgments across six modalities (2302.01308v2)
Abstract: Determining the extent to which the perceptual world can be recovered from language is a longstanding problem in philosophy and cognitive science. We show that state-of-the-art LLMs can unlock new insights into this problem by providing a lower bound on the amount of perceptual information that can be extracted from language. Specifically, we elicit pairwise similarity judgments from GPT models across six psychophysical datasets. We show that the judgments are significantly correlated with human data across all domains, recovering well-known representations like the color wheel and pitch spiral. Surprisingly, we find that a model (GPT-4) co-trained on vision and language does not necessarily lead to improvements specific to the visual modality. To study the influence of specific languages on perception, we also apply the models to a multilingual color-naming task. We find that GPT-4 replicates cross-linguistic variation in English and Russian illuminating the interaction of language and perception.
- (CUP Archive), (1740).
- J Locke, An essay concerning human understanding. (Kay & Troutman), (1847).
- \JournalTitleCognition 84, 295–320 (2002).
- \JournalTitleProceedings of the National Academy of Sciences 104, 1436–1441 (2007).
- \JournalTitleTrends in cognitive sciences 13, 439–446 (2009).
- \JournalTitlePsychological science 24, 613–621 (2013).
- \JournalTitleProceedings of the National Academy of Sciences 115, 7937–7942 (2018).
- \JournalTitleProceedings of the National Academy of Sciences 116, 11213–11222 (2019).
- \JournalTitleTrends in cognitive sciences 24, 930–944 (2020).
- \JournalTitleProceedings of the National Academy of Sciences 118, e2020192118 (2021).
- G Kawakita, A Zeleznikow-Johnston, N Tsuchiya, M Oizumi, Is my “red" your “red"?: Unsupervised alignment of qualia structures via optimal transport. \JournalTitlePsyArXiv (2023).
- \JournalTitleAdvances in neural information processing systems 33, 1877–1901 (2020).
- OpenAI, Gpt-4 technical report (2023).
- \JournalTitlearXiv preprint arXiv:2206.04615 (2022).
- \JournalTitleNature neuroscience 25, 369–380 (2022).
- \JournalTitleBioRxiv pp. 2022–06 (2022).
- \JournalTitleCerebral Cortex p. bhad082 (2023).
- \JournalTitlearXiv preprint arXiv:2109.06129 (2021).
- \JournalTitlearXiv preprint arXiv:2304.07830 (2023).
- \JournalTitlearXiv preprint arXiv:2205.01850 (2022).
- \JournalTitleThe Eleventh International Conference on Learning Representations (2022).
- \JournalTitleTrends in Cognitive Sciences (2023).
- \JournalTitlearXiv preprint arXiv:2302.07459 (2023).
- \JournalTitleScience 210, 390–398 (1980).
- G Ekman, Dimensions of color vision. \JournalTitleThe Journal of Psychology 38, 467–474 (1954).
- DE Kornbrot, Theoretical and empirical comparison of luce’s choice model and logistic thurstone model of categorical judgment. \JournalTitlePerception & Psychophysics 24, 193–208 (1978).
- \JournalTitlearXiv preprint arXiv:1805.08501 (2018).
- \JournalTitlePerception & Psychophysics 61, 1510–1521 (1999).
- (Univ of California Press), (1991).
- (Citeseer), (2009).
- \JournalTitleJournal of vision 14, 17–17 (2014).
- RN Shepard, Geometrical approximations to the structure of musical pitch. \JournalTitlePsychological review 89, 305 (1982).
- \JournalTitleCurrent Biology 29, 3229–3243 (2019).
- \JournalTitleColor Research & Application 43, 358–374 (2018).
- \JournalTitleProceedings of the national academy of sciences 104, 7780–7785 (2007).
- \JournalTitleNature human behaviour 4, 1173–1185 (2020).
- \JournalTitleTrends in cognitive sciences (2022).
- \JournalTitlearXiv preprint arXiv:2301.12867 (2023).
- RN Shepard, Toward a universal law of generalization for psychological science. \JournalTitleScience 237, 1317–1323 (1987).
- CR Sims, Efficient coding explains the universal law of generalization in human perception. \JournalTitleScience 360, 652–656 (2018).
- \JournalTitlebioRxiv (2023).
- (Curran Associates, Inc.), Vol. 33, pp. 10659–10671 (2020).
- \JournalTitleAttention, Perception, & Psychophysics 79, 2064–2072 (2017).
- \JournalTitleFrontiers in systems neuroscience 2, 4 (2008).
- J Clark, The ishihara test for color blindness. \JournalTitleAmerican Journal of Physiological Optics (1924).
- WM Rand, Objective criteria for the evaluation of clustering methods. \JournalTitleJournal of the American Statistical association 66, 846–850 (1971).
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.