NeuroDB: A Neural Network Framework for Answering Range Aggregate Queries and Beyond

07/10/2021
by   Sepanta Zeighami, et al.
0

Range aggregate queries (RAQs) are an integral part of many real-world applications, where, often, fast and approximate answers for the queries are desired. Recent work has studied answering RAQs using machine learning models, where a model of the data is learned to answer the queries. However, such modelling choices fail to utilize any query specific information. To capture such information, we observe that RAQs can be represented by query functions, which are functions that take a query instance (i.e., a specific RAQ) as an input and output its corresponding answer. Using this representation, we formulate the problem of learning to approximate the query function, and propose NeuroDB, a query specialized neural network framework, that answers RAQs efficiently. NeuroDB is query-type agnostic (i.e., it does not make any assumption about the underlying query type) and our observation that queries can be represented by functions is not specific to RAQs. Thus, we investigate whether NeuroDB can be used for other query types, by applying it to distance to nearest neighbour queries. We experimentally show that NeuroDB outperforms the state-of-the-art for this query type, often by orders of magnitude. Moreover, the same neural network architecture as for RAQs is used, bringing to light the possibility of using a generic framework to answer any query type efficiently.

READ FULL TEXT

page 1

page 4

page 9

page 10

page 11

page 16

research
11/20/2022

NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks

Range aggregate queries (RAQs) are an integral part of many real-world a...
research
03/08/2022

Aggregate Queries on Knowledge Graphs: Fast Approximation with Semantic-aware Sampling

A knowledge graph (KG) manages large-scale and real-world facts as a big...
research
06/06/2022

On Efficient Approximate Queries over Machine Learning Models

The question of answering queries over ML predictions has been gaining a...
research
06/15/2022

In-Range Farthest Point Queries and Related Problem in High Dimensions

Range-aggregate query is an important type of queries with numerous appl...
research
08/16/2018

PUG: A Framework and Practical Implementation for Why & Why-Not Provenance (extended version)

Explaining why an answer is (or is not) returned by a query is important...
research
09/05/2019

Random Sampling for Group-By Queries

Random sampling has been widely used in approximate query processing on ...
research
01/22/2018

Smoke: Fine-grained Lineage at Interactive Speed

Data lineage describes the relationship between individual input and out...

Please sign up or login with your details

Forgot password? Click here to reset