RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL (2407.08273v2)

Published 11 Jul 2024 in cs.CL

Abstract: LLMs with in-context learning have significantly improved the performance of text-to-SQL task. Previous works generally focus on using exclusive SQL generation prompt to improve the LLMs' reasoning ability. However, they are mostly hard to handle large databases with numerous tables and columns, and usually ignore the significance of pre-processing database and extracting valuable information for more efficient prompt engineering. Based on above analysis, we propose RB-SQL, a novel retrieval-based LLM framework for in-context prompt engineering, which consists of three modules that retrieve concise tables and columns as schema, and targeted examples for in-context learning. Experiment results demonstrate that our model achieves better performance than several competitive baselines on public datasets BIRD and Spider.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (10)

Zhenhe Wu (4 papers)
Zhongqiu Li (2 papers)
Jie Zhang (846 papers)
Mengxiang Li (3 papers)
Yu Zhao (207 papers)
Ruiyu Fang (2 papers)
Zhongjiang He (11 papers)
Xuelong Li (268 papers)
Zhoujun Li (122 papers)
Shuangyong Song (18 papers)

Tweets

https://twitter.com/gm8xx8/status/1811595737433563312

RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL (2407.08273v2)

Related Papers

Tweets