LMFAO: An Engine for Batches of Group-By Aggregates

08/19/2020
by   Maximilian Schleich, et al.
0

LMFAO is an in-memory optimization and execution engine for large batches of group-by aggregates over joins. Such database workloads capture the data-intensive computation of a variety of data science applications. We demonstrate LMFAO for three popular models: ridge linear regression with batch gradient descent, decision trees with CART, and clustering with Rk-means.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2019

A Layered Aggregate Engine for Analytics Workloads

This paper introduces LMFAO (Layered Multiple Functional Aggregate Optim...
research
04/15/2022

An interpretable machine learning approach for ferroalloys consumptions

This paper is devoted to a practical method for ferroalloys consumption ...
research
10/29/2020

CoroBase: Coroutine-Oriented Main-Memory Database Engine

Data stalls are a major overhead in main-memory database engines due to ...
research
01/20/2022

Scalable k-d trees for distributed data

Data structures known as k-d trees have numerous applications in scienti...
research
10/10/2018

Equality Constrained Decision Trees: For the Algorithmic Enforcement of Group Fairness

Fairness, through its many forms and definitions, has become an importan...
research
09/12/2018

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Gradient boosted decision trees (GBDTs) have seen widespread adoption in...
research
06/14/2020

Application of Data Science to Discover Violence-Related Issues in Iraq

Data science has been satisfactorily used to discover social issues in s...

Please sign up or login with your details

Forgot password? Click here to reset