Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation

05/20/2019
by   Jiaqi Guo, et al.
0

We present a neural approach called IRNet for complex and cross-domain Text-to-SQL. IRNet aims to address two challenges: 1) the mismatch between intents expressed in natural language (NL) and the implementation details in SQL; 2) the challenge in predicting columns caused by the large number of out-of-domain words. Instead of end-to-end synthesizing a SQL query, IRNet decomposes the synthesis process into three phases. In the first phase, IRNet performs a schema linking over a question and a database schema. Then, IRNet adopts a grammar-based neural model to synthesize a SemQL query which is an intermediate representation that we design to bridge NL and SQL. Finally, IRNet deterministically infers a SQL query from the synthesized SemQL query with domain knowledge. On the challenging Text-to-SQL benchmark Spider, IRNet achieves 46.7 state-of-the-art approaches. At the time of writing, IRNet achieves the first position on the Spider leaderboard.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2020

RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

Text-to-SQL is the problem of converting a user question into an SQL que...
research
06/02/2021

LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations

This work aims to tackle the challenging heterogeneous graph encoding pr...
research
09/11/2021

Natural SQL: Making SQL Easier to Infer from Natural Language Specifications

Addressing the mismatch between natural language descriptions and the co...
research
05/29/2020

ValueNet: A Neural Text-to-SQL Architecture Incorporating Values

Building natural language interfaces for databases has been a long-stand...
research
06/17/2021

End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task

In this work, we focus on two crucial components in the cross-domain tex...
research
08/01/2023

Adapt and Decompose: Efficient Generalization of Text-to-SQL via Domain Adapted Least-To-Most Prompting

Cross-domain and cross-compositional generalization of Text-to-SQL seman...
research
04/26/2023

A Case-Based Reasoning Framework for Adaptive Prompting in Cross-Domain Text-to-SQL

Recent advancements in Large Language Models (LLMs), such as Codex, Chat...

Please sign up or login with your details

Forgot password? Click here to reset