An improved method of delta summation for faster current value selection across filtered subsets of interval and temporal relational data

11/10/2022
by   Derek Colley, et al.
0

Aggregation in relational databases is accomplished through hashing and sorting interval data, which is computationally expensive and scales poorly as the data volumes grow. In this paper, we show how quantitative interval and time-series data in relational attributes can be represented using delta summary values rather than absolute values. The need for sorting to determine the row corresponding to some maximum timestamp is negated, reducing the time complexity from at least O(n log(n)) towards O(n) and improving query execution times. We illustrate this new method in the relational algebra, present the implementation algorithmically, and test an implementation in two leading RDBMS products against the use of normal equivalents. We found this delta summation technique to be most effective for use cases with additive, numerical data upon which it is necessary to frequently obtain the latest values, and where the row cardinalities are in the order of 10^5. Our experiments found the proposed new delta summation technique could execute faster than the equivalent standard selection method by up to 22.4 the overall query cost in some circumstances by up to 24.0 by up to 60.6 increase in CPU time and memory allocation, uncertain performance with very low or very high cardinalities and inconsistent results across different RDBMS platforms.

READ FULL TEXT
research
09/29/2021

Relational Memory: Native In-Memory Accesses on Rows and Columns

Analytical database systems are typically designed to use a column-first...
research
01/24/2019

HRDBMS: Combining the Best of Modern and Traditional Relational Databases

HRDBMS is a novel distributed relational database that uses a hybrid mod...
research
03/09/2023

Belnap-Dunn logic and query answering in inconsistent databases with null values

This paper concerns an expansion of first-order Belnap-Dunn logic, named...
research
03/15/2021

Interpretable Feature Construction for Time Series Extrinsic Regression

Supervised learning of time series data has been extensively studied for...
research
07/03/2021

Recombinant Sort: N-Dimensional Cartesian Spaced Algorithm Designed from Synergetic Combination of Hashing, Bucket, Counting and Radix Sort

Sorting is an essential operation which is widely used and is fundamenta...
research
06/26/2017

Relational Algebra for In-Database Process Mining

The execution logs that are used for process mining in practice are ofte...
research
07/19/2023

VAPI: Vectorization of Algorithm for Performance Improvement

This study presents the vectorization of metaheuristic algorithms as the...

Please sign up or login with your details

Forgot password? Click here to reset