Knowledge of cultural moral norms in large language models

Published 2 Jun 2023 in cs.CL | (2306.01857v1)

Abstract: Moral norms vary across cultures. A recent line of work suggests that English LLMs contain human-like moral biases, but these studies typically do not examine moral variation in a diverse cultural setting. We investigate the extent to which monolingual English LLMs contain knowledge about moral norms in different countries. We consider two levels of analysis: 1) whether LLMs capture fine-grained moral variation across countries over a variety of topics such as homosexuality'' anddivorce''; 2) whether LLMs capture cultural diversity and shared tendencies in which topics people around the globe tend to diverge or agree on in their moral judgment. We perform our analyses with two public datasets from the World Values Survey (across 55 countries) and PEW global surveys (across 40 countries) on morality. We find that pre-trained English LLMs predict empirical moral norms across countries worse than the English moral norms reported previously. However, fine-tuning LLMs on the survey data improves inference across countries at the expense of a less accurate estimate of the English moral norms. We discuss the relevance and challenges of incorporating cultural knowledge into the automated inference of moral norms.