Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions (2403.07678v2)

Published 12 Mar 2024 in cs.CL and cs.CY

Abstract: Moral values play a fundamental role in how we evaluate information, make decisions, and form judgements around important social issues. Controversial topics, including vaccination, abortion, racism, and sexual orientation, often elicit opinions and attitudes that are not solely based on evidence but rather reflect moral worldviews. Recent advances in NLP show that moral values can be gauged in human-generated textual content. Building on the Moral Foundations Theory (MFT), this paper introduces MoralBERT, a range of language representation models fine-tuned to capture moral sentiment in social discourse. We describe a framework for both aggregated and domain-adversarial training on multiple heterogeneous MFT human-annotated datasets sourced from Twitter (now X), Reddit, and Facebook that broaden textual content diversity in terms of social media audience interests, content presentation and style, and spreading patterns. We show that the proposed framework achieves an average F1 score that is between 11% and 32% higher than lexicon-based approaches, Word2Vec embeddings, and zero-shot classification with LLMs such as GPT-4 for in-domain inference. Domain-adversarial training yields better out-of domain predictions than aggregate training while achieving comparable performance to zero-shot learning. Our approach contributes to annotation-free and effective morality learning, and provides useful insights towards a more comprehensive understanding of moral narratives in controversial social debates using NLP.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (107)
  1. Alfred V. Aho and Jeffrey D. Ullman. 1972. The Theory of Parsing, Translation and Compiling, volume 1. Prentice-Hall, Englewood Cliffs, NJ.
  2. Approaches to Cross-Domain Sentiment Analysis: A Systematic Literature Review. IEEE Access, 5:16173–16192.
  3. The Moral Debater: A Study on the Computational Generation of Morally Framed Arguments. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, ACL ’22, pages 8782–8797, Dublin, Ireland. Association for Computational Linguistics.
  4. American Psychological Association. 1983. Publications Manual. American Psychological Association, Washington, DC.
  5. Aligning to Social Norms and Values in Interactive Narratives. In Proceedings of the 2022 Conference of the North American Chapter ofthe Association for Computational Linguistics: Human Language Technologies, NAACL ’22, pages 5994–6017, Seattle, USA. Association for Computational Linguistics.
  6. Rie Kubota Ando and Tong Zhang. 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research, 6:1817–1853.
  7. Galen Andrew and Jianfeng Gao. 2007. Scalable training of L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-regularized log-linear models. In Proceedings of the 24th International Conference on Machine Learning, pages 33–40.
  8. Moralstrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction. Knowledge-based systems, 191:105184.
  9. MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction. Knowledge-Based Systems, 191:1–11.
  10. LibertyMFD: A Lexicon to Assess the Moral Foundation of Liberty. In Proceedings of the 2022 ACM Conference on Information Technology for Social Good, GoodIT ’22, page 154–160, New York, NY, USA. Association for Computing Machinery.
  11. Uncovering values: Detecting latent moral content from natural language with explainable and non-trained methods. In Proceedings of Deep Learning Inside Out: The 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, DeeLIO ’22, pages 33–41, Dublin, Ireland and Online. Association for Computational Linguistics.
  12. Towards Using Word Embedding Vector Space for Better Cohort Analysis. In Proceedings of the International AAAI Conference on Web and Social Media, ICWSM ’20, pages 919–923, Atlanta, Georgia. AAAI Press.
  13. Fine-tuning language models to find agreement among humans with diverse preferences. In Advances in Neural Information Processing Systems, NeurIPS ’22, pages 38176–38189. Curran Associates, Inc.
  14. Natural language processing with Python: analyzing text with the natural language toolkit. O’Reilly Media, Inc.
  15. Understanding underlying moral values and language use of covid-19 vaccine attitudes on twitter. PNAS nexus, 2(3):pgad013.
  16. Johan Brännmark. 2015. Moral disunitarianism. The Philosophical Quarterly, 66(264):481–499.
  17. Case Report: Utilizing AI and NLP to Assist with Healthcare and Rehabilitation During the COVID-19 Pandemic. Frontiers in Artificial Intelligence, 4(2):1–7.
  18. Alternation. Journal of the Association for Computing Machinery, 28(1):114–133.
  19. Addressing imbalance in multilabel classification: Measures and random resampling algorithms. Neurocomputing, 163:3–16.
  20. Moral foundations vignettes: A standardized stimulus database of scenarios based on moral foundations theory. Behavior research methods, 47(4):1178–1198.
  21. James W. Cooley and John W. Tukey. 1965. An algorithm for the machine calculation of complex Fourier series. Mathematics of Computation, 19(90):297–301.
  22. An evaluation of document clustering and topic modelling in two online social networks: Twitter and reddit. Information Processing & Management, 57(2):102034.
  23. A Survey of the State of Explainable AI for Natural Language Processing. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, AACL ’20, page 447–459, Suzhou, China.
  24. Automated Hate Speech Detection and the Problem of Offensive Language. In Proceedings of the 11th International Conference on Web and Social Media, ICWSM ’17, pages 512–515.
  25. Automated hate speech detection and the problem of offensive language. In Proceedings of the international AAAI conference on web and social media, volume 11, pages 512–515.
  26. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL ’19, page 4171–4186.
  27. Which moral foundations predict willingness to make lifestyle changes to avert climate change in the USA? PLoS ONE, 11(10):1–11.
  28. Multi-Dimensional Gender Bias Classification. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP ’20, pages 314–331.
  29. Closed-and open-vocabulary approaches to text analysis: A review, quantitative comparison, and recommendations. Psychological Methods, 26(4):398.
  30. Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP ’21, pages 698–718, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
  31. Social Chemistry 101: Learning to Reason about Social and Moral Norms. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP ’20, pages 653–670, Online. Association for Computational Linguistics.
  32. An empirical exploration of moral foundations theory in partisan news sources. In Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC ’16, pages 3730–3736.
  33. Iason Gabriel. 2020. Artificial Intelligence, Values, and Alignment. Minds and Machines, 30(3):411–437.
  34. Yaroslav Ganin and Victor Lempitsky. 2015. Unsupervised domain adaptation by backpropagation. In International conference on machine learning, pages 1180–1189. PMLR.
  35. Automatic assignment of moral foundations to movies by word embedding. Knowledge-Based Systems, 270:110539.
  36. Moral Foundations Theory: The Pragmatic Validity of Moral Pluralism. In Advances in Experimental Social Psychology, volume 47, pages 55–130. Elsevier, Amsterdam, the Netherlands.
  37. Liberals and Conservatives Rely on Different Sets of Moral Foundations. Journal of Personality and Social Psychology, 96(5):1029–1046.
  38. Mapping the moral domain. Journal of personality and social psychology, 101(2):366–85.
  39. A data fusion framework for multi-domain morality learning. In Proceedings of the International AAAI Conference on Web and Social Media, volume 17, pages 281–291.
  40. Dan Gusfield. 1997. Algorithms on Strings, Trees and Sequences. Cambridge University Press, Cambridge, UK.
  41. Jonathan Haidt. 2012. The righteous mind: Why good people are divided by politics and religion. Vintage.
  42. Jonathan Haidt and Jesse Graham. 2007. When morality opposes justice: Conservatives have moral intuitions that liberals may not recognize. Social justice research, 20(1):98–116.
  43. Kevin A. Hallgren. 2012. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial. Tutor Quant Methods Psychol, 8(1):23–34.
  44. Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP ’16, pages 595–605, Austin, Texas, USA.
  45. Aligning AI With Shared Human Values. In Proceedings of the 2021 International Conference on Learning Representations, ICLR ’21, pages 1–29.
  46. Patrick L. Hill and Daniel K. Lapsley. 2009. Persons and situations in the moral domain. Journal of Research in Personality, 43(2):245–246.
  47. The extended moral foundations dictionary (emfd): Development and applications of a crowd-sourced approach to extracting moral intuitions from text. Behavior research methods, 53:232–246.
  48. Learning to Adapt Domain Shifts of Moral Values via Instance Weighting. In Proceedings of the 33rd ACM Conference on Hypertext and Social Media, HT ’22, pages 121–131. Association for Computing Machinery.
  49. Knowledge Graphs meet Moral Values. In Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, *SEM ’20, pages 71–80, Barcelona, Spain (Online). Association for Computational Linguistics.
  50. Can machines learn morality? the delphi experiment. arXiv preprint arXiv:2110.07574.
  51. Predicting demographics, moral foundations, and human values from digital behaviours. Computers in Human Behavior, 92:428–445.
  52. Human values and attitudes towards vaccination in social media. In Companion Proceedings of The 2019 World Wide Web Conference, WWW ’19, pages 248–254.
  53. Moral Concerns are Differentially Observable in Language. Cognition, 212:104696.
  54. Identifying the Human Values behind Arguments. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, ACL ’22, pages 4459–4471, Dublin, Ireland. Association for Computational Linguistics.
  55. Gary King and Langche Zeng. 2001. Logistic regression in rare events data. Political analysis, 9(2):137–163.
  56. Does Personalization Help? Predicting How Social Situations Affect Personal Values. In HHAI2022: Augmenting Human Intellect, pages 157–169.
  57. Alex Gwo Jen Lan and Ivandré Paraboni. 2022. Text- and author-dependent moral foundations classification. New Review of Hypermedia and Multimedia, 0(0):1–21.
  58. What does a text classifier learn about morality? an explainable method for cross-domain comparison of moral rhetoric. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14113–14132.
  59. Cross-Domain Classification of Moral Values. In Findings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL ’22, pages 2727–2745, Seattle, USA. Association for Computational Linguistics.
  60. Value inference in sociotechnical systems: Blue sky ideas track. In Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems, AAMAS ’23, pages 1–7, London, United Kingdom. IFAAMAS.
  61. Axies: Identifying and Evaluating Context-Specific Values. In Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS ’21, pages 799–808, Online. IFAAMAS.
  62. What Values Should an Agent Align With? Autonomous Agents and Multi-Agent Systems, 36(23):32.
  63. On interpretation of network embedding via taxonomy induction. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’18, pages 1812–1820. ACM.
  64. Aligning Generative Language Models with Human Values. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 242–253, Seattle, USA. Association for Computational Linguistics.
  65. Politics: pretraining with same-story article comparison for ideology prediction and stance detection. arXiv preprint arXiv:2205.00619.
  66. UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI ’21, pages 13480–13488.
  67. Unicorn on rainbow: A universal commonsense reasoning model on a new multitask benchmark. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 13480–13488.
  68. Scott M. Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In booktitle = Advances in Neural Information Processing Systems,, NeurIPS ’17, pages 1208–1217, Long Beach, CA, USA.
  69. A Survey on Bias and Fairness in Machine Learning. ACM Computing Surveys, 54(6).
  70. Authority without care: Moral values behind the mask mandate response. In Proceedings of the International AAAI Conference on Web and Social Media, volume 17, pages 614–625.
  71. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, 26.
  72. Omid Mohamad Beigi and Mohammad H. Moattar. 2021. Automatic construction of domain-specific sentiment lexicon for unsupervised domain adaptation and sentiment classification. Knowledge-Based Systems, 213:106423.
  73. Moral framing and ideological bias of news. In Social Informatics: 12th International Conference, SocInfo 2020, Pisa, Italy, October 6–9, 2020, Proceedings 12, pages 206–219. Springer.
  74. Moralization in social networks and the emergence of violence during protests. Nature Human Behaviour, 2(6):389–396.
  75. Public Participation in Crisis Policymaking. How 30,000 Dutch Citizens Advised Their Government on Relaxing COVID-19 Lockdown Measures. PLoS ONE, 16(5):1–42.
  76. StereoSet: Measuring stereotypical bias in pretrained language models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, ACL ’21, pages 5356–5371, Online. Association for Computational Linguistics.
  77. Pansy Nandwani and Rupali Verma. 2021. A review on sentiment analysis and emotion detection from text. Social Network Analysis and Mining, 11(1):81.
  78. Morality Classification in Natural Language Text. IEEE Transactions on Affective Computing, 3045(c):1–8.
  79. Linguistic inquiry and word count: Liwc 2001. Mahway: Lawrence Erlbaum Associates, 71(2001):2001.
  80. Elicitation of Situated Values: Need for Tools to Help Stakeholders and Designers to Reflect and Communicate. Ethics and Information Technology, 14(4):285–303.
  81. Development and Validation of the Personal Values Dictionary: A Theory-Driven Tool for Investigating References to Basic Human Values in Text. European Journal of Personality, 34(5):885–902.
  82. " more than words": Linking music preferences and moral values through lyrics. ISMIR.
  83. Deconfounded Lexicon Induction for Interpretable Social Science. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL ’18, pages 1615–1625, New Orleans, Louisiana, USA.
  84. ValueNet: A New Dataset for Human Value Driven Dialogue System. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI ’22, pages 11183–11191.
  85. Mohammad Sadegh Rasooli and Joel R. Tetreault. 2015. Yara parser: A fast and accurate dependency parser. Computing Research Repository, arXiv:1503.06733. Version 2.
  86. Enhancing the Measurement of Social Effects by Capturing Morality. In Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pages 35–45, Minneapolis, Minnesota, USA.
  87. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, pages 1135–1144.
  88. Research Priorities for Robust and Beneficial Artificial Intelligence. AI Magazine, 36(4):105–114.
  89. Eyal Sagi and Morteza Dehghani. 2014. Measuring moral rhetoric in text. Social science computer review, 32(2):132–144.
  90. Chelsea Schein. 2020. The Importance of Context in Moral Judgments. Perspectives on Psychological Science, 15(2):207–215.
  91. Personality, gender, and age in the language of social media: The open-vocabulary approach. PloS one, 8(9):e73791.
  92. Shalom H. Schwartz. 2012. An Overview of the Schwartz Theory of Basic Values. Online readings in Psychology and Culture, 2(1):1–20.
  93. Estimating Value Preferences in a Hybrid Participatory System. In HHAI2022: Augmenting Human Intellect, pages 114–127, Amsterdam, the Netherlands. IOS Press.
  94. Moral rhetoric in discrete choice models: a natural language processing approach. Quality & Quantity, pages 1–28.
  95. On the Machine Learning of Ethical Judgments from Natural Language. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL ’22, pages 769–779, Seattle, USA.
  96. Mario Triola. 2017. Elementary Statistics, 13th edition. Pearsons.
  97. COVID-19 and Changing Values. In Values for a Post-Pandemic Future, pages 23–58. Springer International Publishing.
  98. Francisco Vargas and Ryan Cotterell. 2020. Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP ’20, pages 2902–2913.
  99. Covfee: an extensible web framework for continuous-time annotation of human behavior. In Understanding Social Behavior in Dyadic and Small Group Interactions, Proceedings of Machine Learning Research, pages 265–293. PMLR.
  100. Desiderata for delivering NLP to accelerate healthcare AI advancement and a Mayo Clinic NLP-as-a-service implementation. npj Digital Medicine, 2(130):1–7.
  101. Garrett Wilson and Diane J. Cook. 2020. A Survey of Unsupervised Deep Domain Adaptation. ACM Transactions on Intelligent Systems and Technology, 11(5).
  102. Building and Validating Hierarchical Lexicons with a Case Study on Personal Values. In Proceedings of the 10th International Conference on Social Informatics, SocInfo ’18, pages 455–470, St. Petersburg, Russia. Springer.
  103. Fangzhao Wu and Yongfeng Huang. 2016. Sentiment domain adaptation with multiple sources. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL ’16, pages 301–310, Berlin, Germany. Association for Computational Linguistics.
  104. Exsum: From local explanations to model understanding. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL ’22, pages 5359–5378, Seattle, USA. Association for Computational Linguistics.
  105. Moral Narratives Around the Vaccination Debate on Facebook. Proceedings of the ACM Web Conference 2023.
  106. Moral Foundations Twitter Corpus: A Collection of 35k Tweets Annotated for Moral Sentiment. Sage Journals.
  107. The Moral Foundations Reddit Corpus. arXiv preprint arXiv:2208.05545.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Vjosa Preniqi (4 papers)
  2. Iacopo Ghinassi (5 papers)
  3. Kyriaki Kalimeri (32 papers)
  4. Charalampos Saitis (27 papers)
  5. Julia Ive (25 papers)
Citations (2)