Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm (2402.10671v3)

Published 16 Feb 2024 in cs.CL

Abstract: In-context learning of large-LLMs has achieved remarkable success in the field of natural language processing, while extensive case studies reveal that the single-step chain-of-thought prompting approach faces challenges such as attention diffusion and inadequate performance in complex tasks like text-to-SQL. To improve the contextual learning capabilities of LLMs in text-to-SQL, a workflow paradigm method is proposed, aiming to enhance the attention and problem-solving scope of LLMs through decomposition. Specifically, the information determination module for eliminating redundant information and the brand-new prompt structure based on problem classification greatly enhance the model's attention. Additionally, the inclusion of self-correction and active learning modules greatly expands the problem-solving scope of LLMs, hence improving the upper limit of LLM-based approaches. Extensive experiments conducted on three datasets demonstrate that our approach outperforms other methods by a significant margin. About 2-3 percentage point improvements compared to the existing baseline on the Spider Dev, Spider-Realistic, and Bird Dev datasets and new SOTA results on the Spider Test dataset are achieved. Our code is available on GitHub: \url{https://github.com/FlyingFeather/DEA-SQL}.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (29)

Authors (10)

Yuanzhen Xie (8 papers)
Xinzhou Jin (4 papers)
Tao Xie (117 papers)
MingXiong Lin (2 papers)
Liang Chen (360 papers)
Chenyun Yu (10 papers)
Lei Cheng (71 papers)
Bo Hu (110 papers)
Zang Li (15 papers)
Chengxiang Zhuo (6 papers)

Citations (11)

View on Semantic Scholar

Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm (2402.10671v3)

Related Papers

Tweets