Byte-Pair Encoding for Text-to-SQL Generation

10/20/2019
by   Samuel Müller, et al.
0

Neural sequence-to-sequence models provide a competitive approach to the task of mapping a question in natural language to an SQL query, also referred to as text-to-SQL generation. The Byte-Pair Encoding algorithm (BPE) has previously been used to improve machine translation (MT) between natural languages. In this work, we adapt BPE for text-to-SQL generation. As the datasets for this task are rather small compared to MT, we present a novel stopping criterion that prevents overfitting the BPE encoding to the training set. Additionally, we present AST BPE, which is a version of BPE that uses the Abstract Syntax Tree (AST) of the SQL statement to guide BPE merges and therefore produce BPE encodings that generalize better. We improved the accuracy of a strong attentive seq2seq baseline on five out of six English text-to-SQL tasks while reducing training time by more than 50 targets. Finally, on two of these tasks we exceeded previously reported accuracies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2023

Divide and Prompt: Chain of Thought Prompting for Text-to-SQL

Chain-of-thought (CoT) prompting combined with large language models (LL...
research
12/19/2022

MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL

Conversational text-to-SQL is designed to translate multi-turn natural l...
research
03/14/2022

S^2SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers

The task of converting a natural language question into an executable SQ...
research
09/14/2018

SQL-to-Text Generation with Graph-to-Sequence Model

Previous work approaches the SQL-to-text generation task using vanilla S...
research
12/21/2020

MT-Teql: Evaluating and Augmenting Consistency of Text-to-SQL Models with Metamorphic Testing

Text-to-SQL is a task to generate SQL queries from human utterances. How...
research
01/12/2023

On the Structural Generalization in Text-to-SQL

Exploring the generalization of a text-to-SQL parser is essential for a ...
research
04/26/2019

One-Shot Learning for Text-to-SQL Generation

Most deep learning approaches for text-to-SQL generation are limited to ...

Please sign up or login with your details

Forgot password? Click here to reset