Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models (2402.01117v1)

Published 2 Feb 2024 in cs.CL, cs.DB, and cs.HC

Abstract: Leading models for the text-to-SQL task heavily rely on proprietary LLMs, posing concerns over data privacy. Closing the performance gap between small open-source models and large proprietary models is crucial to mitigate this reliance. To this end, we introduce a novel two-stage fine-tuning approach that decomposes the task into two simpler tasks. Through comprehensive evaluation on two large cross-domain datasets and two small LLMs, we show that this approach improves execution accuracy by 3 to 7 percent, effectively aligning the performance of open-source models with their proprietary counterparts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Mohammadreza Pourreza (12 papers)
  2. Davood Rafiei (26 papers)
Citations (16)