Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test (2402.02135v1)

Published 3 Feb 2024 in cs.CL and cs.AI

Abstract: This paper explores the moral judgment and moral reasoning abilities exhibited by LLMs across languages through the Defining Issues Test. It is a well known fact that moral judgment depends on the language in which the question is asked. We extend the work of beyond English, to 5 new languages (Chinese, Hindi, Russian, Spanish and Swahili), and probe three LLMs -- ChatGPT, GPT-4 and Llama2Chat-70B -- that shows substantial multilingual text processing and generation abilities. Our study shows that the moral reasoning ability for all models, as indicated by the post-conventional score, is substantially inferior for Hindi and Swahili, compared to Spanish, Russian, Chinese and English, while there is no clear trend for the performance of the latter four languages. The moral judgments too vary considerably by the language.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Aditi Khandelwal (8 papers)
  2. Utkarsh Agarwal (5 papers)
  3. Kumar Tanmay (10 papers)
  4. Monojit Choudhury (66 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com