F-IVM: Analytics over Relational Databases under Updates

03/15/2023
by   Ahmet Kara, et al.
0

This article describes F-IVM, a unified approach for maintaining analytics over changing relational data. We exemplify its versatility in four disciplines: processing queries with group-by aggregates and joins; learning linear regression models using the covariance matrix of the input features; building Chow-Liu trees using pairwise mutual information of the input features; and matrix chain multiplication. F-IVM has three main ingredients: higher-order incremental view maintenance; factorized computation; and ring abstraction. F-IVM reduces the maintenance of a task to that of a hierarchy of simple views. Such views are functions mapping keys, which are tuples of input values, to payloads, which are elements from a ring. F-IVM also supports efficient factorized computation over keys, payloads, and updates. Finally, F-IVM treats uniformly seemingly disparate tasks. In the key space, all tasks require joins and variable marginalization. In the payload space, tasks differ in the definition of the sum and product ring operations. We implemented F-IVM on top of DBToaster and show that it can outperform classical first-order and fully recursive higher-order incremental view maintenance by orders of magnitude while using less memory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2018

Incremental Relational Lenses

Lenses are a popular approach to bidirectional transformations, a genera...
research
12/12/2017

Incremental View Maintenance for Property Graph Queries

This paper discusses the challenges of incremental view maintenance for ...
research
06/19/2018

Reducing Property Graph Queries to Relational Algebra for Incremental View Maintenance

The property graph data model of modern graph database systems is increa...
research
05/23/2019

Conjunctive Queries with Theta Joins Under Updates

Modern application domains such as Composite Event Recognition (CER) and...
research
04/09/2018

Counting Triangles under Updates in Worst-Case Optimal Time

We consider the problem of incrementally maintaining the triangle count ...
research
06/01/2020

F-IVM: Learning over Fast-Evolving Relational Data

F-IVM is a system for real-time analytics such as machine learning appli...
research
06/20/2019

A Layered Aggregate Engine for Analytics Workloads

This paper introduces LMFAO (Layered Multiple Functional Aggregate Optim...

Please sign up or login with your details

Forgot password? Click here to reset