External sorting is at the core of many operations in large-scale databa...
Cardinality estimation is one of the most fundamental and challenging
pr...
Learned index structures have been shown to achieve favorable lookup
per...
We introduce the RadixStringSpline (RSS) learned index structure for
eff...
In-memory join is an essential operator in any database engine. It has b...
Latest research proposes to replace existing index structures with learn...
LearnedSort is a novel sorting algorithm that, unlike traditional method...
In this work, we aim to study when learned models are better hash functi...
Data warehouses organize data in a columnar format to enable faster scan...
Video-based sensing from aerial drones, especially small multirotor dron...
Training high-accuracy object detection models requires large and divers...
Previous approaches to learned cardinality estimation have focused on
im...
Databases employ indexes to filter out irrelevant records, which reduces...
Current operating systems are complex systems that were designed before
...
Filtering data based on predicates is one of the most fundamental operat...
Recent advancements in learned index structures propose replacing existi...
Code similarity systems are integral to a range of applications from cod...
Bloom filters are space-efficient probabilistic data structures that are...
Capturing and processing video is increasingly common as cameras become
...
Pandemic measures such as social distancing and contact tracing can be
e...
Recent research has shown that learned models can outperform state-of-th...
Several programming languages use garbage collectors (GCs) to automatica...
Query optimization remains one of the most challenging problems in data
...
The simplified parse tree (SPT) presented in Aroma, a state-of-the-art c...
Automatic machine learning () is a family of techniques to automate the
...
Scanning and filtering over multi-dimensional tables are key operations ...
A groundswell of recent work has focused on improving data management sy...
Next-generation sequencing (NGS) technologies have enabled affordable
se...
Correctly detecting the semantic type of data columns is crucial for dat...
Researchers currently rely on ad hoc datasets to train automated
visuali...
Query optimization is one of the most challenging problems in database
s...
Machine learning (ML) techniques are enjoying rapidly increasing adoptio...
Cost-based query optimizers remain one of the most important components ...
We present Custodes: a new approach to solving the complex issue of
prev...
Distributed transactions on high-overhead TCP/IP-based networks were
con...
Visual representations of data (visualizations) are tools of great impor...
Over the past decades, researchers and ML practitioners have come up wit...
Data visualization should be accessible for all analysts with data, not ...
As machine learning systems become democratized, it becomes increasingly...
As machine learning (ML) systems become democratized, it becomes increas...
As machine learning (ML) systems become democratized, it becomes increas...
As neural networks become widely deployed in different applications and ...
Existing benchmarks for analytical database systems such as TPC-DS and T...
Index structures are one of the most important tools that DBAs leverage ...
Going deeper and wider in neural architectures improves the accuracy, wh...
Indexes are models: a B-Tree-Index can be seen as a model to map a key t...