2000 character limit reached
A Survey on Employing Large Language Models for Text-to-SQL Tasks (2407.15186v4)
Published 21 Jul 2024 in cs.CL
Abstract: The increasing volume of data in relational databases and the expertise needed for writing SQL queries pose challenges for users to access and analyze data. Text-to-SQL (Text2SQL) solves the issues by utilizing NLP techniques to convert natural language into SQL queries. With the development of LLMs, a range of LLM-based Text2SQL methods have emerged. This survey provides a comprehensive review of LLMs in Text2SQL tasks. We review benchmark datasets, prompt engineering methods, fine-tuning methods, and base models in LLM-based Text2SQL methods. We provide insights in each part and discuss future directions in this field.
- Liang Shi (45 papers)
- Zhengju Tang (2 papers)
- Zhi Yang (188 papers)
- Nan Zhang (144 papers)
- Xiaotong Zhang (28 papers)