The Query Translation Landscape: a Survey

10/07/2019
by   Mohamed Nadjib Mami, et al.
0

Whereas the availability of data has seen a manyfold increase in past years, its value can be only shown if the data variety is effectively tackled —one of the prominent Big Data challenges. The lack of data interoperability limits the potential of its collective use for novel applications. Achieving interoperability through the full transformation and integration of diverse data structures remains an ideal that is hard, if not impossible, to achieve. Instead, methods that can simultaneously interpret different types of data available in different data structures and formats have been explored. On the other hand, many query languages have been designed to enable users to interact with the data, from relational, to object-oriented, to hierarchical, to the multitude emerging NoSQL languages. Therefore, the interoperability issue could be solved not by enforcing physical data transformation, but by looking at techniques that are able to query heterogeneous sources using one uniform language. Both industry and research communities have been keen to develop such techniques, which require the translation of a chosen 'universal' query language to the various data model specific query languages that make the underlying data accessible. In this article, we survey more than forty query translation methods and tools for popular query languages, and classify them according to eight criteria. In particular, we study which query language is a most suitable candidate for that 'universal' query language. Further, the results enable us to discover the weakly addressed and unexplored translation paths, to discover gaps and to learn lessons that can benefit future research in the area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2018

Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources

Apache Calcite is a foundational software framework that provides query ...
research
09/25/2020

Towards a Natural Language Query Processing System

Tackling the information retrieval gap between non-technical database en...
research
03/29/2019

Data structures to represent sets of k-long DNA sequences

The analysis of biological sequencing data has been one of the biggest a...
research
07/11/2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges

We introduce our efforts towards building a universal neural machine tra...
research
06/01/2017

Using of heterogeneous corpora for training of an ASR system

The paper summarizes the development of the LVCSR system built as a part...
research
01/25/2018

A Comprehensive Survey of Watermarking Relational Databases Research

Watermarking and fingerprinting of relational databases are quite profic...
research
09/14/2022

The Impact of Model Transformation Language Features on Quality Properties of MTLs: A Study Protocol

Background: Dedicated model transformation languages are claimed to prov...

Please sign up or login with your details

Forgot password? Click here to reset