Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

41 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

41 tokens/sec

o3 Pro

7 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

238 1

Towards Measuring and Modeling "Culture" in LLMs: A Survey (2403.15412v5)

Published 5 Mar 2024 in cs.CY, cs.AI, and cs.CL

Abstract: We present a survey of more than 90 papers that aim to study cultural representation and inclusion in LLMs. We observe that none of the studies explicitly define "culture, which is a complex, multifaceted concept; instead, they probe the models on some specially designed datasets which represent certain aspects of "culture". We call these aspects the proxies of culture, and organize them across two dimensions of demographic and semantic proxies. We also categorize the probing methods employed. Our analysis indicates that only certain aspects of ``culture,'' such as values and objectives, have been studied, leaving several other interesting and important facets, especially the multitude of semantic domains (Thompson et al., 2020) and aboutness (Hershcovich et al., 2022), unexplored. Two other crucial gaps are the lack of robustness of probing techniques and situated studies on the impact of cultural mis- and under-representation in LLM-based applications.

PDF HTML Abstract

Measuring and Modeling Culture in LLMs: A Survey Overview

The paper "Towards Measuring and Modeling 'Culture' in LLMs: A Survey" provides a comprehensive examination of the intersection between culture and LLMs, focusing on the evaluation of cultural representation, inclusion, and bias. It scrutinizes 39 papers dedicated to this purpose, highlighting the existing methodology, results, and gaps in the current body of literature. The survey underscores the complexity of defining "culture," noting that none of the reviewed papers provide a conclusive definition, instead relying on various cultural proxies within their datasets.

Cultural Proxies and Dimensions

The paper organizes the paper of culture across three main dimensions: demographic proxies, semantic proxies, and language-culture interaction axes.

Demographic Proxies: This dimension includes aspects such as region, language, gender, race, religion, and ethnicity. Region and language often serve as prevalent proxies for culture, but the paper notes that cultural studies involving other dimensions like gender and ethnicity are influenced significantly by Western-centric diversity narratives.
Semantic Proxies: While the majority of studies focus on semantic proxies like emotions and values, the survey identifies a lack of research across the full spectrum of semantic domains, such as kinship terms or physical world concepts.
Language-Culture Interaction: Based on Hershcovich et al. (2022) framework, this dimension categorizes interactions into aboutness, common ground, and objectives/values. The authors found many papers concentrate on objectives and values, while aboutness remains largely unexamined.

Methodologies for Probing Culture in LLMs

The survey categorizes the methodologies used to assess culture within LLMs into black-box and white-box approaches. The predominant method involves black-box probing, where LLMs are queried with culture-specific prompts and their responses analyzed. These techniques are sub-categorized into discriminative probing, where models select from given options, and generative probing, which involves free-text generation by the models. The authors critique the robustness of current probing methods, highlighting issues such as sensitivity to prompts and limited interpretability.

Identified Gaps and Recommendations

The paper identifies three critical gaps: one, limited exploration and coverage of cultural facets, mainly focusing on values and norms; two, limited robustness and reliability in probing methods; and three, absence of contextual and situated studies evaluating practical LLM applications. In addressing these gaps, the authors offer several recommendations:

Definitional Clarifications: Future research should clearly specify the cultural proxies and situate studies within a broader cultural context.
Diverse Cultural Domains: There is a need for wider exploration across various semantic domains and linguistic-cultural interactions.
Interdisciplinary Collaboration: Bridging with anthropology, HCI, and ICTD could offer deeper insights and understanding of cultural nuances.
Increased Focus on Multilingual Datasets: More culturally nuanced and non-translatable datasets should be developed to better reflect and paper cultural interactions in LLMs.

Conclusion

This survey provides a critical assessment of the current status of culture in LLMs by offering a foundational taxonomy and identifying methodological and conceptual weaknesses in existing research. The paper makes crucial strides in understanding how LLMs interact with multifaceted cultural aspects and offers a blueprint for future research endeavors aimed at achieving better cultural representation and inclusion in AI systems.

PDF Markdown Bookmark Chat (Pro)

References (73)

Authors (8)

Muhammad Farid Adilazuarda (14 papers)
Sagnik Mukherjee (13 papers)
Pradhyumna Lavania (2 papers)
Siddhant Singh (7 papers)
Alham Fikri Aji (94 papers)
Jacki O'Neill (4 papers)
Ashutosh Modi (60 papers)
Monojit Choudhury (66 papers)

Citations (21)

View on Semantic Scholar

Tweets

https://twitter.com/AlhamFikri/status/1837168396741308663

https://twitter.com/AlhamFikri/status/1824350172614160665

https://twitter.com/AlhamFikri/status/1792904480339484802

https://twitter.com/saagnikkk/status/1804152229735108761

https://twitter.com/faridlazuarda/status/1806275845637443659

https://twitter.com/calculito/status/1835580798545613005

HackerNews

Towards Measuring and Modeling "Culture" in LLMs: A Survey (1 point, 0 comments)