Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation (1906.00363v2)

Published 2 Jun 2019 in cs.AI and cs.CL

Abstract: Introducing common sense to natural language understanding systems has received increasing research attention. It remains a fundamental question on how to evaluate whether a system has a sense making capability. Existing benchmarks measures commonsense knowledge indirectly and without explanation. In this paper, we release a benchmark to directly test whether a system can differentiate natural language statements that make sense from those that do not make sense. In addition, a system is asked to identify the most crucial reason why a statement does not make sense. We evaluate models trained over large-scale LLMing tasks as well as human performance, showing that there are different challenges for system sense making.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Cunxiang Wang (30 papers)
  2. Shuailong Liang (6 papers)
  3. Yue Zhang (618 papers)
  4. Xiaonan Li (48 papers)
  5. Tian Gao (57 papers)
Citations (109)