Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text-to-SQL (2010.12634v1)

Published 23 Oct 2020 in cs.CL

Abstract: Neural models have achieved significant results on the text-to-SQL task, in which most current work assumes all the input questions are legal and generates a SQL query for any input. However, in the real scenario, users can input any text that may not be able to be answered by a SQL query. In this work, we propose TriageSQL, the first cross-domain text-to-SQL question intention classification benchmark that requires models to distinguish four types of unanswerable questions from answerable questions. The baseline RoBERTa model achieves a 60% F1 score on the test set, demonstrating the need for further improvement on this task. Our dataset is available at https://github.com/chatc/TriageSQL.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yusen Zhang (30 papers)
  2. Xiangyu Dong (17 papers)
  3. Shuaichen Chang (12 papers)
  4. Tao Yu (282 papers)
  5. Peng Shi (80 papers)
  6. Rui Zhang (1138 papers)
Citations (14)

Summary

We haven't generated a summary for this paper yet.