Word Order and World Knowledge (2403.00876v1)

Published 1 Mar 2024 in cs.CL and cs.AI

Abstract: Word order is an important concept in natural language, and in this work, we study how word order affects the induction of world knowledge from raw text using LLMs. We use word analogies to probe for such knowledge. Specifically, in addition to the natural word order, we first respectively extract texts of six fixed word orders from five languages and then pretrain the LLMs on these texts. Finally, we analyze the experimental results of the fixed word orders on word analogies and show that i) certain fixed word orders consistently outperform or underperform others, though the specifics vary across languages, and ii) the Wov2Lex hypothesis is not hold in pre-trained LLMs, and the natural word order typically yields mediocre results. The source code will be made publicly available at https://github.com/lshowway/probing_by_analogy.

References (19)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Word Order and World Knowledge (2403.00876v1)

Summary

Related Papers