GPGPU Performance Estimation with Core and Memory Frequency Scaling

01/19/2017
by   Qiang Wang, et al.
0

Graphics Processing Units (GPUs) support dynamic voltage and frequency scaling (DVFS) in order to balance computational performance and energy consumption. However, there still lacks simple and accurate performance estimation of a given GPU kernel under different frequency settings on real hardware, which is important to decide best frequency configuration for energy saving. This paper reveals a fine-grained model to estimate the execution time of GPU kernels with both core and memory frequency scaling. Over a 2.5x range of both core and memory frequencies among 12 GPU kernels, our model achieves accurate results (within 3.5%) on real hardware. Compared with the cycle-level simulators, our model only needs some simple micro-benchmark to extract a set of hardware parameters and performance counters of the kernels to produce this high accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2020

A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)

Modern computing paradigms, such as cloud computing, are increasingly ad...
research
05/27/2019

The Impact of GPU DVFS on the Energy and Performance of Deep Learning: an Empirical Study

Over the past years, great progress has been made in improving the compu...
research
04/30/2022

Predict; Do not React for Enabling Efficient Fine Grain DVFS in GPUs

With the continuous improvement of on-chip integrated voltage regulators...
research
07/25/2018

Rendering Elimination: Early Discard of Redundant Tiles in the Graphics Pipeline

GPUs are one of the most energy-consuming components for real-time rende...
research
03/01/2021

Accelerating Distributed-Memory Autotuning via Statistical Analysis of Execution Paths

The prohibitive expense of automatic performance tuning at scale has lar...
research
04/23/2022

GAMORRA: An API-Level Workload Model for Rasterization-based Graphics Pipeline Architecture

The performance of applications that require frame rendering time estima...
research
02/19/2021

DeepScaleTool : A Tool for the Accurate Estimation of Technology Scaling in the Deep-Submicron Era

The estimation of classical CMOS "constant-field" or "Dennard" scaling m...

Please sign up or login with your details

Forgot password? Click here to reset