Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning (2408.16482v1)

Published 29 Aug 2024 in cs.CL

Abstract: Improving the alignment of LLMs with respect to the cultural values that they encode has become an increasingly important topic. In this work, we study whether we can exploit existing knowledge about cultural values at inference time to adjust model responses to cultural value probes. We present a simple and inexpensive method that uses a combination of in-context learning (ICL) and human survey data, and show that we can improve the alignment to cultural values across 5 models that include both English-centric and multilingual LLMs. Importantly, we show that our method could prove useful in test languages other than English and can improve alignment to the cultural values that correspond to a range of culturally diverse countries.

Authors (2)

Rochelle Choenni (17 papers)
Ekaterina Shutova (52 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning (2408.16482v1)

Summary

Related Papers