Evaluating Query Languages and Systems for High-Energy Physics Data

04/26/2021
by   Dan Graur, et al.
0

In the domain of high-energy physics (HEP), query languages in general and SQL in particular have found limited acceptance. This is surprising since HEP data analysis matches the SQL model well: the data is fully structured and queried using mostly standard operators. To gain insights on why this is the case, we perform a comprehensive analysis of six diverse, general-purpose data processing platforms using an HEP benchmark. The result of the evaluation is an interesting and rather complex picture of existing solutions: Their query languages vary greatly in how natural and concise HEP query patterns can be expressed. Furthermore, most of them are also between one and two orders of magnitude slower than the domain-specific system used by particle physicists today. These observations suggest that, while database systems and their query languages are in principle viable tools for HEP, significant work remains to make them relevant to HEP researchers.

READ FULL TEXT
research
06/07/2023

ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems

Natural Language to SQL systems (NL-to-SQL) have recently shown a signif...
research
03/31/2018

A comparative analysis of state-of-the-art SQL-on-Hadoop systems for interactive analytics

Hadoop is emerging as the primary data hub in enterprises, and SQL repre...
research
08/20/2017

Fast Access to Columnar, Hierarchically Nested Data via Code Transformation

Big Data query systems represent data in a columnar format for fast, sel...
research
11/03/2017

Toward real-time data query systems in HEP

Exploratory data analysis tools must respond quickly to a user's questio...
research
08/23/2023

Saggitarius: A DSL for Specifying Grammatical Domains

Common data types like dates, addresses, phone numbers and tables can ha...
research
07/07/2018

Recommender system for learning SQL using hints

Today's software industry requires individuals who are proficient in as ...
research
02/12/2021

Querying collections of tree-structured records in the presence of within-record referential constraints

In this paper, we consider a tree-structured data model used in many com...

Please sign up or login with your details

Forgot password? Click here to reset