Measuring Social Norms of Large Language Models (2404.02491v4)

Published 3 Apr 2024 in cs.CL, cs.AI, and cs.LG

Abstract: We present a new challenge to examine whether LLMs understand social norms. In contrast to existing datasets, our dataset requires a fundamental understanding of social norms to solve. Our dataset features the largest set of social norm skills, consisting of 402 skills and 12,383 questions covering a wide set of social norms ranging from opinions and arguments to culture and laws. We design our dataset according to the K-12 curriculum. This enables the direct comparison of the social understanding of LLMs to humans, more specifically, elementary students. While prior work generates nearly random accuracy on our benchmark, recent LLMs such as GPT3.5-Turbo and LLaMA2-Chat are able to improve the performance significantly, only slightly below human performance. We then propose a multi-agent framework based on LLMs to improve the models' ability to understand social norms. This method further improves LLMs to be on par with humans. Given the increasing adoption of LLMs in real-world applications, our finding is particularly important and presents a unique direction for future improvements.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (49)

Authors (5)

Ye Yuan (274 papers)
Kexin Tang (3 papers)
Jianhao Shen (18 papers)
Ming Zhang (313 papers)
Chenguang Wang (59 papers)

Citations (2)

View on Semantic Scholar

Tweets

https://twitter.com/ChenguangWang/status/1776370902458114282

https://twitter.com/realmofresearch/status/1782073143345197492

Measuring Social Norms of Large Language Models (2404.02491v4)

Related Papers

Tweets