Towards Enhancing Database Education: Natural Language Generation Meets Query Execution Plans

03/01/2021
by   Weiguo Wang, et al.
0

The database systems course is offered as part of an undergraduate computer science degree program in many major universities. A key learning goal of learners taking such a course is to understand how SQL queries are processed in a RDBMS in practice. Since a query execution plan (QEP) describes the execution steps of a query, learners can acquire the understanding by perusing the QEPs generated by a RDBMS. Unfortunately, in practice, it is often daunting for a learner to comprehend these QEPs containing vendor-specific implementation details, hindering her learning process. In this paper, we present a novel, end-to-end, generic system called lantern that generates a natural language description of a qep to facilitate understanding of the query execution steps. It takes as input an SQL query and its QEP, and generates a natural language description of the execution strategy deployed by the underlying RDBMS. Specifically, it deploys a declarative framework called pool that enables subject matter experts to efficiently create and maintain natural language descriptions of physical operators used in QEPs. A rule-based framework called RULE-LANTERN is proposed that exploits pool to generate natural language descriptions of QEPs. Despite the high accuracy of RULE-LANTERN, our engagement with learners reveal that, consistent with existing psychology theories, perusing such rule-based descriptions lead to boredom due to repetitive statements across different QEPs. To address this issue, we present a novel deep learning-based language generation framework called NEURAL-LANTERN that infuses language variability in the generated description by exploiting a set of paraphrasing tools and word embedding. Our experimental study with real learners shows the effectiveness of lantern in facilitating comprehension of QEPs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2022

ARENA: Towards Informative Alternative Query Plan Selection for Database Education

A key learning goal of learners taking database systems course is to und...
research
05/15/2018

NEURON: Query Optimization Meets Natural Language Processing For Augmenting Database Education

Relational database management system (RDBMS) is a major undergraduate c...
research
03/03/2022

PhD thesis: SQL Comprehension and Synthesis

Structured Query Language (SQL) remains the standard language used in Re...
research
04/19/2022

CodexDB: Generating Code for Processing SQL Queries using GPT-3 Codex

CodexDB is an SQL processing engine whose internals can be customized vi...
research
12/26/2022

Natural Language Interfaces to Data

Recent advances in NLU and NLP have resulted in renewed interest in natu...
research
10/04/2020

Holistic static and animated 3D scene generation from diverse text descriptions

We propose a framework for holistic static and animated 3D scene generat...
research
06/06/2018

Extended Diffix

A longstanding open problem is that of how to get high quality statistic...

Please sign up or login with your details

Forgot password? Click here to reset