Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CDConv: A Benchmark for Contradiction Detection in Chinese Conversations (2210.08511v1)

Published 16 Oct 2022 in cs.CL

Abstract: Dialogue contradiction is a critical issue in open-domain dialogue systems. The contextualization nature of conversations makes dialogue contradiction detection rather challenging. In this work, we propose a benchmark for Contradiction Detection in Chinese Conversations, namely CDConv. It contains 12K multi-turn conversations annotated with three typical contradiction categories: Intra-sentence Contradiction, Role Confusion, and History Contradiction. To efficiently construct the CDConv conversations, we devise a series of methods for automatic conversation generation, which simulate common user behaviors that trigger chatbots to make contradictions. We conduct careful manual quality screening of the constructed conversations and show that state-of-the-art Chinese chatbots can be easily goaded into making contradictions. Experiments on CDConv show that properly modeling contextual information is critical for dialogue contradiction detection, but there are still unresolved challenges that require future research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Chujie Zheng (35 papers)
  2. Jinfeng Zhou (15 papers)
  3. Yinhe Zheng (30 papers)
  4. Libiao Peng (6 papers)
  5. Zhen Guo (76 papers)
  6. Wenquan Wu (12 papers)
  7. Zhengyu Niu (4 papers)
  8. Hua Wu (191 papers)
  9. Minlie Huang (226 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.