Delving into the Reversal Curse: How Far Can Large Language Models Generalize? (2410.18808v2)

Published 24 Oct 2024 in cs.CL

Abstract: While LLMs showcase unprecedented capabilities, they also exhibit certain inherent limitations when facing seemingly trivial tasks. A prime example is the recently debated "reversal curse", which surfaces when models, having been trained on the fact "A is B", struggle to generalize this knowledge to infer that "B is A". In this paper, we examine the manifestation of the reversal curse across various tasks and delve into both the generalization abilities and the problem-solving mechanisms of LLMs. This investigation leads to a series of significant insights: (1) LLMs are able to generalize to "B is A" when both A and B are presented in the context as in the case of a multiple-choice question. (2) This generalization ability is highly correlated to the structure of the fact "A is B" in the training documents. For example, this generalization only applies to biographies structured in "[Name] is [Description]" but not to "[Description] is [Name]". (3) We propose and verify the hypothesis that LLMs possess an inherent bias in fact recalling during knowledge application, which explains and underscores the importance of the document structure to successful learning. (4) The negative impact of this bias on the downstream performance of LLMs can hardly be mitigated through training alone. These findings offer a novel perspective on interpreting LLMs' generalization through their intrinsic mechanisms and provide insights for developing more effective learning methods. Our code and data are available at https://github.com/alibaba/thinking_bias.git.

References (54)

Authors (9)

Zhengkai Lin (2 papers)
Zhihang Fu (17 papers)
Kai Liu (391 papers)
Liang Xie (38 papers)
Binbin Lin (50 papers)
Wenxiao Wang (63 papers)
Deng Cai (181 papers)
Yue Wu (339 papers)
Jieping Ye (169 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Delving into the Reversal Curse: How Far Can Large Language Models Generalize? (2410.18808v2)

Summary

Related Papers