Analytical Model of Memory-Bound Applications Compiled with High Level Synthesis

03/29/2020
by   Maria A. Dávila-Guzmán, et al.
0

The increasing demand of dedicated accelerators to improve energy efficiency and performance has highlighted FPGAs as a promising option to deliver both. However, programming FPGAs in hardware description languages requires long time and effort to achieve optimal results, which discourages many programmers from adopting this technology. High Level Synthesis tools improve the accessibility to FPGAs, but the optimization process is still time expensive due to the large compilation time, between minutes and days, required to generate a single bitstream. Whereas placing and routing take most of this time, the RTL pipeline and memory organization are known in seconds. This early information about the organization of the upcoming bitstream is enough to provide an accurate and fast performance model. This paper presents a performance analytical model for HLS designs focused on memory bound applications. With a careful analysis of the generated memory architecture and DRAM organization, the model predicts the execution time with a maximum error of 9.2 previous works, our predictions reduce on average at least 2× the estimation error.

READ FULL TEXT

page 1

page 7

research
03/24/2019

An Analytical Model for Performance and Lifetime Estimation of Hybrid DRAM-NVM Main Memories

NVMs have promising advantages (e.g., lower idle power, higher density) ...
research
08/04/2023

Automatic multi-dimensional pipelining for high-level synthesis of dataflow accelerators

In recent years, there has been a surging demand for edge computing of i...
research
08/29/2023

Best Memory Architecture Exploration under Parameters Variations accelerated with Machine Learning

The design of effective memory architecture is of utmost importance in m...
research
10/15/2019

Refresh Triggered Computation: Improving the Energy Efficiency of Convolutional Neural Network Accelerators

Recently, many studies proposed CNN accelerator architectures with custo...
research
02/03/2023

Enabling Relational Database Analytical Processing in Bulk-Bitwise Processing-In-Memory

Bulk-bitwise processing-in-memory (PIM), an emerging computational parad...
research
10/18/2020

Optimizing Memory Performance of Xilinx FPGAs under Vitis

Plenty of research efforts have been devoted to FPGA-based acceleration,...
research
12/22/2022

FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

Multi-die FPGAs are widely adopted to deploy large hardware accelerators...

Please sign up or login with your details

Forgot password? Click here to reset