Natural language to SQL in low-code platforms

08/29/2023
by   Sofia Aparicio, et al.
0

One of the developers' biggest challenges in low-code platforms is retrieving data from a database using SQL queries. Here, we propose a pipeline allowing developers to write natural language (NL) to retrieve data. In this study, we collect, label, and validate data covering the SQL queries most often performed by OutSystems users. We use that data to train a NL model that generates SQL. Alongside this, we describe the entire pipeline, which comprises a feedback loop that allows us to quickly collect production data and use it to retrain our SQL generation model. Using crowd-sourcing, we collect 26k NL and SQL pairs and obtain an additional 1k pairs from production data. Finally, we develop a UI that allows developers to input a NL query in a prompt and receive a user-friendly representation of the resulting SQL query. We use A/B testing to compare four different models in production and observe a 240 terms of adoption of the feature, 220 decrease in failure rate when compared against the first model that we put into production, showcasing the effectiveness of our pipeline in continuously improving our feature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2022

CodexDB: Generating Code for Processing SQL Queries using GPT-3 Codex

CodexDB is an SQL processing engine whose internals can be customized vi...
research
08/31/2017

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning

A significant amount of the world's knowledge is stored in relational da...
research
09/13/2021

SPARQLing Database Queries from Intermediate Question Decompositions

To translate natural language questions into executable database queries...
research
12/12/2021

Weakly Supervised Mapping of Natural Language to SQL through Question Decomposition

Natural Language Interfaces to Databases (NLIDBs), where users pose quer...
research
01/24/2023

Studying the Characteristics of SQL-related Development Tasks: An Empirical Study

A key function of a software system is its ability to facilitate the man...
research
03/15/2023

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization

We present Mirror, an open-source platform for data exploration and anal...
research
01/06/2022

On the Prevalence, Impact, and Evolution of SQL Code Smells in Data-Intensive Systems

Code smells indicate software design problems that harm software quality...

Please sign up or login with your details

Forgot password? Click here to reset