Self-supervised Text-to-SQL Learning with Header Alignment Training

03/11/2021
by   Donggyu Kim, et al.
0

Since we can leverage a large amount of unlabeled data without any human supervision to train a model and transfer the knowledge to target tasks, self-supervised learning is a de-facto component for the recent success of deep learning in various fields. However, in many cases, there is a discrepancy between a self-supervised learning objective and a task-specific objective. In order to tackle such discrepancy in Text-to-SQL task, we propose a novel self-supervised learning framework. We utilize the task-specific properties of Text-to-SQL task and the underlying structures of table contents to train the models to learn useful knowledge of the header-column alignment task from unlabeled table data. We are able to transfer the knowledge to the supervised Text-to-SQL training with annotated samples, so that the model can leverage the knowledge to better perform the header-span alignment task to predict SQL statements. Experimental results show that our self-supervised learning framework significantly improves the performance of the existing strong BERT based models without using large external corpora. In particular, our method is effective for training the model with scarce labeled data. The source code of this work is available in GitHub.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2018

Boosting Self-Supervised Learning via Knowledge Transfer

In self-supervised learning, one trains a model to solve a so-called pre...
research
10/11/2022

CSS: Combining Self-training and Self-supervised Learning for Few-shot Dialogue State Tracking

Few-shot dialogue state tracking (DST) is a realistic problem that train...
research
03/17/2023

Data-Centric Learning from Unlabeled Graphs with Diffusion Model

Graph property prediction tasks are important and numerous. While each t...
research
10/29/2020

Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency Detection

Most existing approaches to disfluency detection heavily rely on human-a...
research
07/27/2021

Combining Probabilistic Logic and Deep Learning for Self-Supervised Learning

Deep learning has proven effective for various application tasks, but it...
research
10/24/2020

Structure-Grounded Pretraining for Text-to-SQL

Learning to capture text-table alignment is essential for table related ...
research
12/18/2020

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Most recently, there has been significant interest in learning contextua...

Please sign up or login with your details

Forgot password? Click here to reset