Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ValueDCG: Measuring Comprehensive Human Value Understanding Ability of Language Models (2310.00378v4)

Published 30 Sep 2023 in cs.CL, cs.AI, and cs.CY

Abstract: Personal values are a crucial factor behind human decision-making. Considering that LLMs have been shown to impact human decisions significantly, it is essential to make sure they accurately understand human values to ensure their safety. However, evaluating their grasp of these values is complex due to the value's intricate and adaptable nature. We argue that truly understanding values in LLMs requires considering both "know what" and "know why". To this end, we present a comprehensive evaluation metric, ValueDCG (Value Discriminator-Critique Gap), to quantitatively assess the two aspects with an engineering implementation. We assess four representative LLMs and provide compelling evidence that the growth rates of LLM's "know what" and "know why" capabilities do not align with increases in parameter numbers, resulting in a decline in the models' capacity to understand human values as larger amounts of parameters. This may further suggest that LLMs might craft plausible explanations based on the provided context without truly understanding their inherent value, indicating potential risks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zhaowei Zhang (25 papers)
  2. Fengshuo Bai (11 papers)
  3. Jun Gao (267 papers)
  4. Yaodong Yang (169 papers)
Citations (1)