Investigating Human Values in Online Communities (2402.14177v3)
Abstract: Studying human values is instrumental for cross-cultural research, enabling a better understanding of preferences and behaviour of society at large and communities therein. To study the dynamics of communities online, we propose a method to computationally analyse values present on Reddit. Our method allows analysis at scale, complementing survey based approaches. We train a value relevance and a value polarity classifier, which we thoroughly evaluate using in-domain and out-of-domain human annotations. Using these, we automatically annotate over six million posts across 12k subreddits with Schwartz values. Our analysis unveils both previously recorded and novel insights into the values prevalent within various online communities. For instance, we discover a very negative stance towards conformity in the Vegan and AbolishTheMonarchy subreddits. Additionally, our study of geographically specific subreddits highlights the correlation between traditional values and conservative U.S. states. Through our work, we demonstrate how our dataset and method can be used as a complementary tool for qualitative study of online communication.
- Wallstreetbets beyond gamestop, yolos, and the moon: The unique traits of reddit’s finance communities. In AMCIS 2022.
- Probing pre-trained language models for cross-cultural differences in values. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP), pages 114–130, Dubrovnik, Croatia. Association for Computational Linguistics.
- Dimensions of culture: The case of slovakia as an outlier in hofstede’s research. Ceskoslovenska Psychologie, 60(1).
- A corpus of German Reddit exchanges (GeRedE). In Proceedings of the 12th Language Resources and Evaluation Conference, pages 6310–6316, Marseille, France. European Language Resources Association.
- Values in words: Using language to evaluate and understand personal values. Proceedings of the International AAAI Conference on Web and Social Media, 9(1):31–40.
- Pew Research Center. 2014. Political ideology by state.
- Quarantined! examining the effects of a community-wide moderation intervention on reddit. ACM Trans. Comput.-Hum. Interact., 29(4).
- Structural equivalence of the values domain across cultures: Distinguishing sampling fluctuations from meaningful variation. Journal of Cross-Cultural Psychology, 39(4):345–365.
- Philipp Gerlach and Kimmo Eriksson. 2021. Measuring cultural dimensions: external validity and internal consistency of hofstede’s vsm 2013 scales. Frontiers in Psychology, 12:662604.
- Matej Gjurković and Jan Šnajder. 2018. Reddit: A gold mine for personality prediction. In Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media, pages 87–97, New Orleans, Louisiana, USA. Association for Computational Linguistics.
- Chapter two - moral foundations theory: The pragmatic validity of moral pluralism. In Patricia Devine and Ashby Plant, editors, Advances in Experimental Social Psychology, volume 47 of Advances in Experimental Social Psychology, pages 55–130. Academic Press.
- Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing.
- Geert Hofstede. 1984. Culture’s consequences: International differences in work-related values, volume 5. sage.
- Differences between omnivores and vegetarians in personality profiles, values, and empathy: a systematic review. Frontiers in psychology, 12:579700.
- Automatic generation of large-scale multi-turn dialogues from Reddit. In Proceedings of the 29th International Conference on Computational Linguistics, pages 3360–3373, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
- Terence Jackson. 2020. The legacy of geert hofstede.
- Alexander Jaffe. 2009. Stance: Sociolinguistic perspectives. In Stance: Sociolinguistic Perspectives.
- Co-writing with opinionated language models affects users’ views. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, CHI ’23, New York, NY, USA. Association for Computing Machinery.
- Mistral 7b.
- Identifying the human values behind arguments. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4459–4471, Dublin, Ireland. Association for Computational Linguistics.
- An empirical assessment of generational differences in basic human values. Psychological reports, 101(2):339–352.
- Methodological problems in cross-cultural research: An updated review. MIR: Management International Review, pages 79–91.
- User migration in online social networks: A case study on reddit during a period of community unrest. In Proceedings of the International AAAI Conference on Web and Social Media, volume 10, pages 279–288.
- Bonny Norton. 1997. Language, identity, and the ownership of english. TESOL Quarterly, 31(3):409–429.
- mRedditSum: A multimodal abstractive summarization dataset of Reddit threads with images. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 4117–4132, Singapore. Association for Computational Linguistics.
- Development and validation of the personal values dictionary: A theory-driven tool for investigating references to basic human values in text. European Journal of Personality, 34(5):885–902.
- Valuenet: A new dataset for human value driven dialogue system. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11183–11191.
- Milton Rokeach. 1973. The Nature of Human Values. The Nature of Human Values. Free Press, New York, NY, US.
- Identifying morality frames in political tweets using relational learning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 9939–9958, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Values and religiosity: A meta-analysis of studies using schwartz’s model. Personality and individual differences, 37(4):721–734.
- Shalom H. Schwartz. 1994. Are there universal aspects in the structure and contents of human values? Journal of Social Issues, 50(4):19–45.
- Shalom H. Schwartz. 2012. An overview of the schwartz theory of basic values. Online Readings in Psychology and Culture, 2:11.
- Shalom H Schwartz and Anat Bardi. 1997. Influences of adaptation to communist rule on value priorities in eastern europe. Political psychology, 18(2):385–410.
- Shalom H Schwartz and Jan Cieciuch. 2022. Measuring the refined theory of individual values in 49 cultural groups: psychometrics of the revised portrait value questionnaire. Assessment, 29(5):1005–1019.
- A characterization of political communities on reddit. In Proceedings of the 30th ACM Conference on Hypertext and Social Media, HT ’19, page 259–263, New York, NY, USA. Association for Computing Machinery.
- Ethical challenges in online research: Public/private perceptions. Research Ethics, 13(3-4):184–199.
- The moral foundations reddit corpus.
- Elsbeth Turcan and Kathy McKeown. 2019. Dreaddit: A Reddit dataset for stress analysis in social media. In Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), pages 97–107, Hong Kong. Association for Computational Linguistics.
- Do differences in values influence disagreements in online discussions? In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 15986–16008, Singapore. Association for Computational Linguistics.
- Making online communities ’better’: A taxonomy of community values on reddit.
- Linguistic analysis of schizophrenia in Reddit posts. In Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, pages 74–83, Minneapolis, Minnesota. Association for Computational Linguistics.
- Alyssa N Zucker and Laina Y Bay-Cheng. 2010. Minding the gap between feminist identity and attitudes: The behavioral and ideological divide between feminists and non-labelers. Journal of personality, 78(6):1895–1924.