Implementing Window Functions in a Column-Store with Late Materialization (Extended Version)

08/06/2022
by   Nadezhda Mukhaleva, et al.
0

A window function is a generalization of the aggregation operation. Unlike aggregation, the cardinality of its output is always the same as the cardinality of input. That is, the semantics of this operator imply computing values for extra attributes for each row, depending on its context, either expressed by a sliding window or a previously evaluated row. Window functions are a very powerful tool, which is also popular among data analysts and supported by the majority of industrial DBMSes. It allows to gracefully express quite complex use-cases, such as running sums and averages, local maximum and minimum, and different types of ranking. Since they can be expressed without self-joins and correlated subqueries, their evaluation can be performed much more efficiently. In this paper we discuss an implementation of window functions inside a disk-based column-store with late materialization. Late materialization is a technique that aims to keep tuple reconstruction back from individual columns as long as possible. Initially popular in the late 00's, it is rarely considered nowadays. However, in case of window functions it allows to substantially lower memory footprint. Another contribution of this paper is the application of a segment tree to computing RANGE-based window functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

Support Aggregate Analytic Window Function over Large Data by Spilling

Analytic function, also called window function, is to query the aggregat...
research
03/15/2021

Smoothness of Schatten Norms and Sliding-Window Matrix Streams

Large matrices are often accessed as a row-order stream. We consider the...
research
09/29/2020

In-Order Sliding-Window Aggregation in Worst-Case Constant Time

Sliding-window aggregation is a widely-used approach for extracting insi...
research
12/01/2018

VATE: a trade-off between memory and preserving time for high accuracy cardinalities estimation under sliding time window

Host cardinality is one of the important attributes in the field of netw...
research
10/26/2018

Sub-O(log n) Out-of-Order Sliding-Window Aggregation

Sliding-window aggregation summarizes the most recent information in a d...
research
10/14/2020

Continuous window functions for NFFT

In this paper, we study the error behavior of the nonequispaced fast Fou...
research
04/08/2020

A Comparative Analysis of Knowledge Graph Query Performance

As Knowledge Graphs (KGs) continue to gain widespread momentum for use i...

Please sign up or login with your details

Forgot password? Click here to reset