Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

08/30/2023
by   Michael Staniek, et al.
0

We present Text-to-OverpassQL, a task designed to facilitate a natural language interface for querying geodata from OpenStreetMap (OSM). The Overpass Query Language (OverpassQL) allows users to formulate complex database queries and is widely adopted in the OSM ecosystem. Generating Overpass queries from natural language input serves multiple use-cases. It enables novice users to utilize OverpassQL without prior knowledge, assists experienced users with crafting advanced queries, and enables tool-augmented large language models to access information stored in the OSM database. In order to assess the performance of current sequence generation models on this task, we propose OverpassNL, a dataset of 8,352 queries with corresponding natural language inputs. We further introduce task specific evaluation metrics and ground the evaluation of the Text-to-OverpassQL task by executing the queries against the OSM database. We establish strong baselines by finetuning sequence-to-sequence models and adapting large language models with in-context examples. The detailed evaluation reveals strengths and weaknesses of the considered learning strategies, laying the foundations for further research into the Text-to-OverpassQL task.

READ FULL TEXT
research
06/15/2023

From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management

Large language models have recently advanced the state of the art on man...
research
12/15/2020

Generation of complex database queries and API calls from natural language utterances

Generating queries corresponding to natural language questions is a long...
research
05/12/2023

Text2Cohort: Democratizing the NCI Imaging Data Commons with Natural Language Cohort Discovery

The Imaging Data Commons (IDC) is a cloud-based database that provides r...
research
05/09/2023

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

There is a rapidly growing number of large language models (LLMs) that u...
research
04/06/2022

Sub-Task Decomposition Enables Learning in Sequence to Sequence Tasks

The field of Natural Language Processing (NLP) has experienced a dramati...
research
03/15/2023

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization

We present Mirror, an open-source platform for data exploration and anal...
research
02/16/2016

Contextual Media Retrieval Using Natural Language Queries

The widespread integration of cameras in hand-held and head-worn devices...

Please sign up or login with your details

Forgot password? Click here to reset