Revisiting Query Performance in GPU Database Systems

02/01/2023
by   Jiashen Cao, et al.
0

GPUs offer massive compute parallelism and high-bandwidth memory accesses. GPU database systems seek to exploit those capabilities to accelerate data analytics. Although modern GPUs have more resources (e.g., higher DRAM bandwidth) than ever before, judicious choices for query processing that avoid wasteful resource allocations are still advantageous. Database systems can save GPU runtime costs through just-enough resource allocation or improve query throughput with concurrent query processing by leveraging new GPU capabilities, such as Multi-Instance GPU (MIG). In this paper we do a cross-stack performance and resource utilization analysis of five GPU database systems. We study both database-level and micro-architectural aspects, and offer recommendations to database developers. We also demonstrate how to use and extend the traditional roofline model to identify GPU resource bottlenecks. This enables users to conduct what-if analysis to forecast performance impact for different resource allocation or the degree of concurrency. Our methodology addresses a key user pain point in selecting optimal configurations by removing the need to do exhaustive testing for a multitude of resource configurations.

READ FULL TEXT
research
06/27/2019

State-of-the-Art on Query Transaction Processing Acceleration

The vast amount of processing power and memory bandwidth provided by mod...
research
03/02/2020

A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics (Extended Version)

There has been significant amount of excitement and recent work on GPU-b...
research
05/05/2020

Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters

While prior researches focus on CPU-based microservices, they are not ap...
research
08/19/2022

Resource Allocation in Serverless Query Processing

Data lakes hold a growing amount of cold data that is infrequently acces...
research
08/13/2018

Database Operations in D4M.jl

Each step in the data analytics pipeline is important, including databas...
research
02/21/2018

Analyzing Real-Time Multimedia Content From Network Cameras: Using CPUs and GPUs in the Cloud

Millions of network cameras are streaming real-time multimedia content (...
research
12/16/2021

Predictive Price-Performance Optimization for Serverless Query Processing

We present an efficient, parametric modeling framework for predictive re...

Please sign up or login with your details

Forgot password? Click here to reset