Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness (2301.08881v2)

Published 21 Jan 2023 in cs.CL

Abstract: Neural text-to-SQL models have achieved remarkable performance in translating natural language questions into SQL queries. However, recent studies reveal that text-to-SQL models are vulnerable to task-specific perturbations. Previous curated robustness test sets usually focus on individual phenomena. In this paper, we propose a comprehensive robustness benchmark based on Spider, a cross-domain text-to-SQL benchmark, to diagnose the model robustness. We design 17 perturbations on databases, natural language questions, and SQL queries to measure the robustness from different angles. In order to collect more diversified natural question perturbations, we utilize large pretrained LLMs (PLMs) to simulate human behaviors in creating natural questions. We conduct a diagnostic study of the state-of-the-art models on the robustness set. Experimental results reveal that even the most robust model suffers from a 14.0% performance drop overall and a 50.7% performance drop on the most challenging perturbation. We also present a breakdown analysis regarding text-to-SQL model designs and provide insights for improving model robustness.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Shuaichen Chang (12 papers)
  2. Jun Wang (991 papers)
  3. Mingwen Dong (6 papers)
  4. Lin Pan (23 papers)
  5. Henghui Zhu (24 papers)
  6. Alexander Hanbo Li (17 papers)
  7. Wuwei Lan (12 papers)
  8. Sheng Zhang (212 papers)
  9. Jiarong Jiang (8 papers)
  10. Joseph Lilien (2 papers)
  11. Steve Ash (1 paper)
  12. William Yang Wang (254 papers)
  13. Zhiguo Wang (100 papers)
  14. Vittorio Castelli (24 papers)
  15. Patrick Ng (29 papers)
  16. Bing Xiang (74 papers)
Citations (28)