Large Language Model Enhanced Text-to-SQL Generation: A Survey (2410.06011v1)
Abstract: Text-to-SQL translates natural language queries into Structured Query Language (SQL) commands, enabling users to interact with databases using natural language. Essentially, the text-to-SQL task is a text generation task, and its development is primarily dependent on changes in LLMs. Especially with the rapid development of LLMs, the pattern of text-to-SQL has undergone significant changes. Existing survey work mainly focuses on rule-based and neural-based approaches, but it still lacks a survey of Text-to-SQL with LLMs. In this paper, we survey the LLM enhanced text-to-SQL generations, classifying them into prompt engineering, fine-tuning, pre-trained, and Agent groups according to training strategies. We also summarize datasets and evaluation metrics comprehensively. This survey could help people better understand the pattern, research status, and challenges of LLM-based text-to-SQL generations.
- Xiaohu Zhu (2 papers)
- Qian Li (236 papers)
- Lizhen Cui (66 papers)
- Yongkang Liu (35 papers)