From Stability to Inconsistency: A Study of Moral Preferences in LLMs (2504.06324v1)

Published 8 Apr 2025 in cs.CY and cs.AI

Abstract: As LLMs increasingly integrate into our daily lives, it becomes crucial to understand their implicit biases and moral tendencies. To address this, we introduce a Moral Foundations LLM dataset (MFD-LLM) grounded in Moral Foundations Theory, which conceptualizes human morality through six core foundations. We propose a novel evaluation method that captures the full spectrum of LLMs' revealed moral preferences by answering a range of real-world moral dilemmas. Our findings reveal that state-of-the-art models have remarkably homogeneous value preferences, yet demonstrate a lack of consistency.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (4)

Tweets

https://twitter.com/WGOV/status/1910285280214397313

From Stability to Inconsistency: A Study of Moral Preferences in LLMs (2504.06324v1)

Summary

Follow-up Questions

Related Papers

Authors (4)

Tweets