Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL (2406.10593v2)

Published 15 Jun 2024 in cs.AI, cs.DB, and cs.IR

Abstract: Fine-tuning LLMs for specific domain tasks has achieved great success in Text-to-SQL tasks. However, these fine-tuned models often face challenges with multi-turn Text-to-SQL tasks caused by ambiguous or unanswerable questions. It is desired to enhance LLMs to handle multiple types of questions in multi-turn Text-to-SQL tasks. To address this, we propose a novel data augmentation method, called QDA-SQL, which generates multiple types of multi-turn Q&A pairs using LLMs. In QDA-SQL, we introduce a method incorporating validation and correction mechanisms to handle complex multi-turn Text-to-SQL tasks. Experimental results demonstrate that QDA-SQL enables fine-tuned models to exhibit higher performance on SQL statement accuracy and enhances their ability to handle complex, unanswerable questions in multi-turn Text-to-SQL tasks. The generation script and test set are released at https://github.com/mcxiaoxiao/QDA-SQL

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Yinggang Sun (4 papers)
  2. Ziming Guo (3 papers)
  3. Haining Yu (7 papers)
  4. Chuanyi Liu (12 papers)
  5. Xiang Li (1002 papers)
  6. Bingxuan Wang (10 papers)
  7. Xiangzhan Yu (7 papers)
  8. Tiancheng Zhao (48 papers)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com