Integration of Skyline Queries into Spark SQL

10/07/2022
by   Lukas Grasmann, et al.
0

Skyline queries are frequently used in data analytics and multi-criteria decision support applications to filter relevant information from big amounts of data. Apache Spark is a popular framework for processing big, distributed data. The framework even provides a convenient SQL-like interface via the Spark SQL module. However, skyline queries are not natively supported and require tedious rewriting to fit the SQL standard or Spark's SQL-like language. The goal of our work is to fill this gap. We thus provide a full-fledged integration of the skyline operator into Spark SQL. This allows for a simple and easy to use syntax to input skyline queries. Moreover, our empirical results show that this integrated solution of skyline queries by far outperforms a solution based on rewriting into standard SQL.

READ FULL TEXT
research
11/17/2020

Multi-SQL: An extensible multi-model data query language

Big data management aims to establish data hubs that support data in mul...
research
02/21/2020

Snel: SQL Native Execution for LLVM

Snel is a relational database engine featuring Just-In-Time (JIT) compil...
research
07/25/2019

Applying Constraint Logic Programming to SQL Semantic Analysis

This paper proposes the use of Constraint Logic Programming (CLP) to mod...
research
03/14/2019

LIKE Patterns and Complexity

We investigate the expressive power and complexity questions for the LIK...
research
05/30/2022

Demonstration of LogicLib: An Expressive Multi-Language Interface over Scalable Datalog System

With the ever-increasing volume of data, there is an urgent need to prov...
research
11/14/2020

Synthesis of SQL Queries from South African Local Language Narrations

English remains the language of choice for database courses and widely u...
research
02/26/2018

In-database connected component analysis

We describe a Big Data-practical, SQL-implementable algorithm for effici...

Please sign up or login with your details

Forgot password? Click here to reset