The Moral Foundations Reddit Corpus

Published 10 Aug 2022 in cs.CL, cs.CY, and cs.LG | (2208.05545v3)

Abstract: Moral framing and sentiment can affect a variety of online and offline behaviors, including donation, environmental action, political engagement, and protest. Various computational methods in NLP have been used to detect moral sentiment from textual data, but achieving strong performance in such subjective tasks requires large, hand-annotated datasets. Previous corpora annotated for moral sentiment have proven valuable, and have generated new insights both within NLP and across the social sciences, but have been limited to Twitter. To facilitate improving our understanding of the role of moral rhetoric, we present the Moral Foundations Reddit Corpus, a collection of 16,123 English Reddit comments that have been curated from 12 distinct subreddits, hand-annotated by at least three trained annotators for 8 categories of moral sentiment (i.e., Care, Proportionality, Equality, Purity, Authority, Loyalty, Thin Morality, Implicit/Explicit Morality) based on the updated Moral Foundations Theory (MFT) framework. We evaluate baselines using LLMs (Llama3-8B, Ministral-8B) in zero-shot, few-shot, and PEFT settings, comparing their performance to fine-tuned encoder-only models like BERT. The results show that LLMs continue to lag behind fine-tuned encoders on this subjective task, underscoring the ongoing need for human-annotated moral corpora for AI alignment evaluation. Keywords: moral sentiment annotation, moral values, moral foundations theory, multi-label text classification, LLMs, benchmark dataset, evaluation and alignment resource

Abstract PDF Upgrade to Chat

Authors (17)

First 10 authors:

Citations (16)

View on Semantic Scholar

Summary

The paper presents a novel dataset of 16,123 Reddit comments annotated for 8 moral sentiment categories using the Moral Foundations Theory framework.
It employs rigorous annotation protocols with trained annotators, reporting medium to high interannotator agreement using metrics like Fleiss' kappa and PABAK.
Baseline evaluations show that fine-tuned models outperform large language models, underscoring the dataset's cross-domain applicability for nuanced moral sentiment analysis.

The Moral Foundations Reddit Corpus

The paper "The Moral Foundations Reddit Corpus" (2208.05545) introduces an annotated dataset focused on understanding moral sentiment within the context of Reddit comments. This corpus aims to facilitate improved insights into the dynamics of moral language across various social communities.

Introduction and Motivation

The introduction of the Moral Foundations Reddit Corpus (MFRC) addresses the need for annotations of moral rhetoric in diverse linguistic and social settings. Moral sentiment, as expressed through online platforms, impacts behaviors such as political engagement and social actions. Existing datasets have largely focused on Twitter, limiting the scope for analysis to short-form text from a single platform. To expand the analytical framework, MFRC compiles Reddit comments, enabling exploration of moral sentiment in a broader, community-driven context.

Dataset Composition

The MFRC consists of 16,123 comments from 12 morally-relevant subreddits, providing annotations for 8 categories of moral sentiment. This includes Care, Proportionality, Equality, Purity, Authority, Loyalty, Thin Morality, and Implicit/Explicit Morality based on the updated Moral Foundations Theory (MFT) framework. The corpus is structured to allow for in-depth analyses of moral dynamics within different subreddit communities.

Figure 1: The heatmaps show Interannotator Agreement (PABAK and Kappa) scores for all subreddits and foundations. Higher agreement corresponds with darker colors in both heatmaps.

Annotation Process

Annotations were conducted by trained annotators, with demographic and psychological metadata collected to investigate potential bias and response patterns. This level of detail aims to ensure robust analysis across subjective dimensions of morality and addresses inherent subjectivity in sentiment classification. Interannotator agreements were assessed using Fleiss’s kappa and PABAK, indicating medium to high agreements once adjusted for prevalence issues.

Model Evaluation

Baseline evaluations were conducted using LLMs such as Llama3-8B and Ministral-8B, as well as fine-tuned models like BERT. These evaluations included zero-shot, few-shot, and parameter-efficient fine-tuning (PEFT) settings to assess the comparative performance of these models. Results indicated that fine-tuned models outperformed LLMs, underscoring the necessity of domain-specific annotated datasets for accurate sentiment classification.

Cross-Domain Applications

Preliminary results demonstrate transferability between MFRC and the Moral Foundations Twitter Corpus (MFTC), highlighting cross-platform and cross-linguistic analysis potential. Training models on MFRC and testing on MFTC—and vice versa—yielded comparable performance, suggesting that further research could capitalize on these datasets for generalized moral sentiment classification.

Conclusion

The MFRC builds upon prior work to provide a rich dataset for analyzing moral language within varied social contexts. By including community-specific and longer-form discourse from Reddit, this corpus facilitates new lines of inquiry into moral sentiment analysis. While the dataset provides substantial detail and methodological updates, researchers are advised to consider inherent biases and data-specific limitations. The corpus stands as a benchmark for evaluating moral sentiment classifications and can aid in developing comprehensive AI alignment strategies.

In summary, the MFRC not only bridges existing gaps in moral language datasets but also paves the way for extended applications in both NLP and social science domains. The dataset’s structure and annotation detail foster nuanced analyses of moral rhetoric, integral to understanding contemporary social dynamics online.

Markdown Report Issue