STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing

10/21/2022
by   Zefeng Cai, et al.
0

In this paper, we propose a novel SQL guided pre-training framework STAR for context-dependent text-to-SQL parsing, which leverages contextual information to enrich natural language (NL) utterance and table schema representations for text-to-SQL conversations. Concretely, we propose two novel pre-training objectives which respectively explore the context-dependent interactions of NL utterances and SQL queries within each text-to-SQL conversation: (i) schema state tracking (SST) objective that tracks and explores the schema states of context-dependent SQL queries in the form of schema-states by predicting and updating the value of each schema slot during interaction; (ii) utterance dependency tracking (UDT) objective that employs weighted contrastive learning to pull together two semantically similar NL utterances and push away the representations of semantically dissimilar NL utterances within each conversation. In addition, we construct a high-quality large-scale context-dependent text-to-SQL conversation corpus to pre-train STAR. Extensive experiments show that STAR achieves new state-of-the-art performance on two downstream benchmarks (SParC and CoSQL), significantly outperforming previous pre-training methods and ranking first on the leaderboard. We believe the release of the constructed corpus, codebase and pre-trained STAR checkpoints would push forward the research in this area. For reproducibility, we release our code and data at https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/star.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2019

X-SQL: reinforce schema representation with context

In this work, we present X-SQL, a new network architecture for the probl...
research
12/18/2020

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Most recently, there has been significant interest in learning contextua...
research
12/09/2020

Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing

The task of multi-turn text-to-SQL semantic parsing aims to translate na...
research
09/14/2022

SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers

This paper aims to improve the performance of text-to-SQL parsing by exp...
research
12/17/2022

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Recently, there has been increasing interest in synthesizing data to imp...
research
11/18/2021

SDCUP: Schema Dependency-Enhanced Curriculum Pre-Training for Table Semantic Parsing

Recently pre-training models have significantly improved the performance...
research
09/19/2019

A Split-and-Recombine Approach for Follow-up Query Analysis

Context-dependent semantic parsing has proven to be an important yet cha...

Please sign up or login with your details

Forgot password? Click here to reset