Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models (2408.11261v1)

Published 21 Aug 2024 in cs.AI and cs.CL

Abstract: Large Vision-LLMs (LVLMs) have shown significant capability in vision-language understanding. However, one critical issue that persists in these models is sycophancy, which means models are unduly influenced by leading or deceptive prompts, resulting in biased outputs and hallucinations. Despite the progress in LVLMs, evaluating and mitigating sycophancy is yet much under-explored. In this work, we fill this gap by systematically analyzing sycophancy on various VL benchmarks with curated leading queries and further proposing a text contrastive decoding method for mitigation. While the specific sycophantic behavior varies significantly among models, our analysis reveals the severe deficiency of all LVLMs in resilience of sycophancy across various tasks. For improvement, we propose Leading Query Contrastive Decoding (LQCD), a model-agnostic method focusing on calibrating the LVLMs' over-reliance on leading cues by identifying and suppressing the probabilities of sycophancy tokens at the decoding stage. Extensive experiments show that LQCD effectively mitigate sycophancy, outperforming both prompt engineering methods and common methods for hallucination mitigation. We further demonstrate that LQCD does not hurt but even slightly improves LVLMs' responses to neutral queries, suggesting it being a more effective strategy for general-purpose decoding but not limited to sycophancy.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Yunpu Zhao (4 papers)
  2. Rui Zhang (1140 papers)
  3. Junbin Xiao (24 papers)
  4. Changxin Ke (4 papers)
  5. Ruibo Hou (6 papers)
  6. Yifan Hao (28 papers)
  7. Qi Guo (237 papers)
  8. Yunji Chen (51 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.