Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers

10/29/2022
by   Abhijeet Awasthi, et al.
0

Text-to-SQL parsers typically struggle with databases unseen during the train time. Adapting parsers to new databases is a challenging problem due to the lack of natural language queries in the new schemas. We present ReFill, a framework for synthesizing high-quality and textually diverse parallel datasets for adapting a Text-to-SQL parser to a target schema. ReFill learns to retrieve-and-edit text queries from the existing schemas and transfers them to the target schema. We show that retrieving diverse existing text, masking their schema-specific tokens, and refilling with tokens relevant to the target schema, leads to significantly more diverse text queries than achievable by standard SQL-to-Text generation methods. Through experiments spanning multiple databases, we demonstrate that fine-tuning parsers on datasets synthesized using ReFill consistently outperforms the prior data-augmentation methods.

READ FULL TEXT
research
12/17/2022

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Recently, there has been increasing interest in synthesizing data to imp...
research
01/10/2023

Structured Case-based Reasoning for Inference-time Adaptation of Text-to-SQL parsers

Inference-time adaptation methods for semantic parsing are useful for le...
research
03/03/2021

Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing

Data augmentation has attracted a lot of research attention in the deep ...
research
03/23/2016

Enabling Cognitive Intelligence Queries in Relational Databases using Low-dimensional Word Embeddings

We apply distributed language embedding methods from Natural Language Pr...
research
08/08/2022

Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph

The generalizability to new databases is of vital importance to Text-to-...
research
01/12/2023

On the Structural Generalization in Text-to-SQL

Exploring the generalization of a text-to-SQL parser is essential for a ...
research
10/21/2020

DuoRAT: Towards Simpler Text-to-SQL Models

Recent research has shown that neural text-to-SQL models can effectively...

Please sign up or login with your details

Forgot password? Click here to reset