Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation (1804.08207v2)

Published 23 Apr 2018 in cs.CL

Abstract: We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our collection as the DNC: Diverse Natural Language Inference Collection. The DNC is available online at https://www.decomp.net, and will grow over time as additional resources are recast and added from novel sources.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Adam Poliak (17 papers)
  2. Aparajita Haldar (8 papers)
  3. Rachel Rudinger (46 papers)
  4. J. Edward Hu (5 papers)
  5. Ellie Pavlick (66 papers)
  6. Aaron Steven White (29 papers)
  7. Benjamin Van Durme (173 papers)