A Pilot Study for Chinese SQL Semantic Parsing

09/29/2019
by   Qingkai Min, et al.
0

The task of semantic parsing is highly useful for dialogue and question answering systems. Many datasets have been proposed to map natural language text into SQL, among which the recent Spider dataset provides cross-domain samples with multiple tables and complex queries. We build a Spider dataset for Chinese, which is currently a low-resource language in this task area. Interesting research questions arise from the uniqueness of the language, which requires word segmentation, and also from the fact that SQL keywords and columns of DB tables are typically written in English. We compare character- and word-based encoders for a semantic parser, and different embedding schemes. Results show that word-based semantic parser is subject to segmentation errors and cross-lingual word embeddings are useful for text-to-SQL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2018

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task

We present Spider, a large-scale, complex and cross-domain semantic pars...
research
10/05/2020

A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese

Semantic parsing is an important NLP task. However, Vietnamese is a low-...
research
10/23/2019

AnnaParser: Semantic Parsing for Tabular Data Analysis

This paper presents a novel approach to translating natural language que...
research
10/23/2019

A Hybrid Semantic Parsing Approach for Tabular Data Analysis

This paper presents a novel approach to translating natural language que...
research
08/26/2022

SeSQL: Yet Another Large-scale Session-level Chinese Text-to-SQL Dataset

As the first session-level Chinese dataset, CHASE contains two separate ...
research
06/08/2021

Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface

A natural language database interface (NLDB) can democratize data-driven...
research
01/03/2023

Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge

In this paper, we study the problem of knowledge-intensive text-to-SQL, ...

Please sign up or login with your details

Forgot password? Click here to reset