Approximate Query Processing using Deep Generative Models

Data is generated at an unprecedented rate surpassing our ability to analyze them. One viable solution that was pioneered by the database community is Approximate Query Processing (AQP). AQP seeks to provide approximate answers to queries in a fraction of time needed for computing exact answers. This is often achieved by running the query on a pre-computed or on-demand derived sample and generating estimates for the entire dataset based on the result. In this work, we explore a novel approach for AQP utilizing deep learning (DL). We use deep generative models, an unsupervised learning based approach, to learn the data distribution faithfully in a compact manner (typically few hundred KBs). Queries could be answered approximately by generating samples from the learned model. This approach eliminates the dependency of AQP to a sample of fixed size and allows us to satisfy arbitrary accuracy requirements by generating as many samples as needed very fast. While we specifically focus on variational autoencoders (VAE), we demonstrate how our approach could also be used for other popular DL models such as generative adversarial networks (GAN) and deep Bayesian networks (DBN). Our other contributions include (a) identifying model bias and minimizing it through a rejection sampling based approach (b) An algorithm to build model ensembles for AQP for improved accuracy and (c) an analysis of VAE latent space to understand its suitability to AQP. Our extensive experiments show that deep learning is a very promising approach for AQP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2017

On Unifying Deep Generative Models

Deep generative models have achieved impressive success in recent years....
research
03/17/2020

Characterizing and Avoiding Problematic Global Optima of Variational Autoencoders

Variational Auto-encoders (VAEs) are deep generative latent variable mod...
research
11/14/2017

Conditional Autoencoders with Adversarial Information Factorization

Generative models, such as variational auto-encoders (VAE) and generativ...
research
12/08/2018

Counterfactuals uncover the modular structure of deep generative models

Deep generative models such as Generative Adversarial Networks (GANs) an...
research
06/14/2019

DeepSPACE: Approximate Geospatial Query Processing with Deep Learning

The amount of the available geospatial data grows at an ever faster pace...
research
02/06/2023

Variational Information Pursuit for Interpretable Predictions

There is a growing interest in the machine learning community in develop...
research
12/20/2022

Approximate Query Processing via Tuple Bubbles

We propose a versatile approach to lightweight, approximate query proces...

Please sign up or login with your details

Forgot password? Click here to reset