On Repairing Natural Language to SQL Queries (2310.03866v1)

Published 5 Oct 2023 in cs.DB and cs.SE

Abstract: Data analysts use SQL queries to access and manipulate data on their databases. However, these queries are often challenging to write, and small mistakes can lead to unexpected data output. Recent work has explored several ways to automatically synthesize queries based on a user-provided specification. One promising technique called text-to-SQL consists of the user providing a natural language description of the intended behavior and the database's schema. Even though text-to-SQL tools are becoming more accurate, there are still many instances where they fail to produce the correct query. In this paper, we analyze when text-to-SQL tools fail to return the correct query and show that it is often the case that the returned query is close to a correct query. We propose to repair these failing queries using a mutation-based approach that is agnostic to the text-to-SQL tool being used. We evaluate our approach on two recent text-to-SQL tools, RAT-SQL and SmBoP, and show that our approach can repair a significant number of failing queries.

References (19)

Authors (8)

Aidan Z. H. Yang (6 papers)
Ricardo Brancas (4 papers)
Pedro Esteves (1 paper)
Sofia Aparicio (4 papers)
Joao Pedro Nadkarni (1 paper)
Miguel Terra-Neves (5 papers)
Vasco Manquinho (27 papers)
Ruben Martins (24 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

On Repairing Natural Language to SQL Queries (2310.03866v1)

Summary

Related Papers