SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation (2310.18376v4)
Abstract: In recent years, the task of text-to-SQL translation, which converts natural language questions into executable SQL queries, has gained significant attention for its potential to democratize data access. Despite its promise, challenges such as adapting to unseen databases and aligning natural language with SQL syntax have hindered widespread adoption. To overcome these issues, we introduce SQLformer, a novel Transformer architecture specifically crafted to perform text-to-SQL translation tasks. Our model predicts SQL queries as abstract syntax trees (ASTs) in an autoregressive way, incorporating structural inductive bias in the encoder and decoder layers. This bias, guided by database table and column selection, aids the decoder in generating SQL query ASTs represented as graphs in a Breadth-First Search canonical order. Our experiments demonstrate that SQLformer achieves state-of-the-art performance across six prominent text-to-SQL benchmarks.
- Translating synthetic natural language to database queries with a polyglot deep learning framework. Scientific Reports, 11(1).
- Global Reasoning over Database Structures for Text-to-SQL Parsing. ArXiv:1908.11214 [cs].
- An Encoder-Decoder Framework Translating Natural Language to Database Queries. ArXiv:1711.06061 [cs].
- SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL. ArXiv:2111.00653 [cs].
- Yitao Cai and Xiaojun Wan. 2020. IGSQL: Database schema interaction graph based neural model for context-dependent text-to-SQL generation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 6903–6912, Online. Association for Computational Linguistics.
- Star: Sql guided pre-training for context-dependent text-to-sql parsing.
- LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations. ArXiv:2106.01093 [cs].
- RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases. ArXiv:2004.03125 [cs].
- ELECTRA: Pre-training text encoders as discriminators rather than generators. In ICLR.
- Expanding the Scope of the ATIS Task: The ATIS-3 Corpus. In Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. ArXiv:1810.04805 [cs].
- Language to Logical Form with Neural Attention. ArXiv:1601.01280 [cs].
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ArXiv:2010.11929 [cs].
- Matthias Fey and Jan Eric Lenssen. 2019. Fast Graph Representation Learning with PyTorch Geometric. ArXiv:1903.02428 [cs, stat].
- Towards robustness of text-to-sql models against synonym substitution.
- Exploring underexplored limitations of cross-domain text-to-sql generalization.
- Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation. ArXiv:1905.08205 [cs].
- X-SQL: reinforce schema representation with context. ArXiv:1908.08113 [cs].
- The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24-27,1990.
- Dynamic hybrid relation network for cross-domain context-dependent semantic parsing.
- S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers. ArXiv:2203.06958 [cs].
- A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization. ArXiv:1902.01069 [cs].
- Graphix-t5: Mixing pre-trained transformers with graph-aware layers for text-to-sql parsing. arXiv preprint arXiv:2301.07507.
- RoBERTa: A Robustly Optimized BERT Pretraining Approach. ArXiv:1907.11692 [cs].
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. ArXiv:1912.01703 [cs, stat].
- RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL. ArXiv:2205.06983 [cs].
- Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 101–108, Online. Association for Computational Linguistics.
- Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. ArXiv:1910.10683 [cs, stat].
- Ohad Rubin and Jonathan Berant. 2021. SmBoP: Semi-autoregressive bottom-up semantic parsing. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 311–324, Online. Association for Computational Linguistics.
- PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. ArXiv:2109.05093 [cs].
- Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? ArXiv:2010.12725 [cs].
- Learning contextual representations for semantic parsing with generation-augmented pre-training. Proceedings of the AAAI Conference on Artificial Intelligence, 35(15):13806–13814.
- Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 8372–8388, Online. Association for Computational Linguistics.
- Attention Is All You Need. ArXiv:1706.03762 [cs].
- Graph Attention Networks. ArXiv:1710.10903 [cs, stat].
- RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers. ArXiv:1911.04942 [cs].
- Relational Graph Attention Network for Aspect-based Sentiment Analysis. ArXiv:2004.12362 [cs].
- Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing. ArXiv:2206.14017 [cs].
- Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 38–45, Online. Association for Computational Linguistics.
- Optimizing deeper transformers on small datasets. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 2089–2102, Online. Association for Computational Linguistics.
- SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning. ArXiv:1711.04436 [cs].
- SQLizer: query synthesis from natural language. Proceedings of the ACM on Programming Languages, 1(OOPSLA):63:1–63:26.
- Pengcheng Yin and Graham Neubig. 2017a. A Syntactic Neural Model for General-Purpose Code Generation. ArXiv:1704.01696 [cs].
- Pengcheng Yin and Graham Neubig. 2017b. A syntactic neural model for general-purpose code generation. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 440–450, Vancouver, Canada. Association for Computational Linguistics.
- TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data. ArXiv:2005.08314 [cs].
- Do Transformers Really Perform Bad for Graph Representation? ArXiv:2106.05234 [cs].
- GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models. ArXiv:1802.08773 [cs].
- TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation. ArXiv:1804.09769 [cs].
- GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. ArXiv:2009.13845 [cs].
- Cosql: A conversational text-to-sql challenge towards cross-domain natural language interfaces to databases.
- {SC}ore: Pre-training for context representation in conversational semantic parsing. In International Conference on Learning Representations.
- Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task. ArXiv:1809.08887 [cs].
- Sparc: Cross-domain semantic parsing in context.
- John M. Zelle and Raymond J. Mooney. 1996. Learning to parse database queries using inductive logic programming. In Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2, AAAI’96, pages 1050–1055, Portland, Oregon. AAAI Press.
- Luke S. Zettlemoyer and Michael Collins. 2012. Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars. ArXiv:1207.1420 [cs].
- Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions. ArXiv:1909.00786 [cs].
- Hie-sql: History information enhanced network for context-dependent text-to-sql semantic parsing.
- Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. ArXiv:1709.00103 [cs].
- Adrián Bazaga (10 papers)
- Pietro Liò (270 papers)
- Gos Micklem (7 papers)