A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese

10/05/2020
by   Anh Tuan Nguyen, et al.
8

Semantic parsing is an important NLP task. However, Vietnamese is a low-resource language in this research area. In this paper, we present the first public large-scale Text-to-SQL semantic parsing dataset for Vietnamese. We extend and evaluate two strong semantic parsing baselines EditSQL (Zhang et al., 2019) and IRNet (Guo et al., 2019) on our dataset. We compare the two baselines with key configurations and find that: automatic Vietnamese word segmentation improves the parsing results of both baselines; the normalized pointwise mutual information (NPMI) score (Bouma, 2009) is useful for schema linking; latent syntactic features extracted from a neural dependency parser for Vietnamese also improve the results; and the monolingual language model PhoBERT for Vietnamese (Nguyen and Nguyen, 2020) helps produce higher performances than the recent best multilingual language model XLM-R (Conneau et al., 2020).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2019

A Pilot Study for Chinese SQL Semantic Parsing

The task of semantic parsing is highly useful for dialogue and question ...
research
12/27/2022

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

Text-to-SQL semantic parsing is an important NLP task, which greatly fac...
research
05/12/2018

Backpropagating through Structured Argmax using a SPIGOT

We introduce the structured projection of intermediate gradients optimiz...
research
06/03/2021

The Limitations of Limited Context for Constituency Parsing

Incorporating syntax into neural approaches in NLP has a multitude of pr...
research
08/12/2021

Kicktionary-LOME: A Domain-Specific Multilingual Frame Semantic Parsing Model for Football Language

This technical report introduces an adapted version of the LOME frame se...
research
10/07/2020

Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

Task-oriented semantic parsing is a critical component of virtual assist...
research
03/17/2019

Technical notes: Syntax-aware Representation Learning With Pointer Networks

This is a work-in-progress report, which aims to share preliminary resul...

Please sign up or login with your details

Forgot password? Click here to reset