Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL (2407.08273v2)

Published 11 Jul 2024 in cs.CL

Abstract: LLMs with in-context learning have significantly improved the performance of text-to-SQL task. Previous works generally focus on using exclusive SQL generation prompt to improve the LLMs' reasoning ability. However, they are mostly hard to handle large databases with numerous tables and columns, and usually ignore the significance of pre-processing database and extracting valuable information for more efficient prompt engineering. Based on above analysis, we propose RB-SQL, a novel retrieval-based LLM framework for in-context prompt engineering, which consists of three modules that retrieve concise tables and columns as schema, and targeted examples for in-context learning. Experiment results demonstrate that our model achieves better performance than several competitive baselines on public datasets BIRD and Spider.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Zhenhe Wu (4 papers)
  2. Zhongqiu Li (2 papers)
  3. Jie Zhang (846 papers)
  4. Mengxiang Li (3 papers)
  5. Yu Zhao (207 papers)
  6. Ruiyu Fang (2 papers)
  7. Zhongjiang He (11 papers)
  8. Xuelong Li (268 papers)
  9. Zhoujun Li (122 papers)
  10. Shuangyong Song (18 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com