TAPA: A Scalable Task-Parallel Dataflow Programming Framework for Modern FPGAs with Co-Optimization of HLS and Physical Design

09/06/2022
by   Licheng Guo, et al.
0

In this paper, we propose TAPA, an end-to-end framework that compiles a C++ task-parallel dataflow program into a high-frequency FPGA accelerator. Compared to existing solutions, TAPA has two major advantages. First, TAPA provides a set of convenient APIs that allow users to easily express flexible and complex inter-task communication structures. Second, TAPA adopts a coarse-grained floorplanning step during HLS compilation for accurate pipelining of potential critical paths. In addition, TAPA implements several optimization techniques specifically tailored for modern HBM-based FPGAs. In our experiments with a total of 43 designs, we improve the average frequency from 147 MHz to 297 MHz (a 102 resource utilization. Notably, in 16 experiments we make the originally unroutable designs achieve 274 MHz on average. The framework is available at https://github.com/UCLA-VAST/tapa and the core floorplan module is available at https://github.com/UCLA-VAST/AutoBridge.

READ FULL TEXT

page 1

page 6

research
04/26/2021

Yes, BM25 is a Strong Baseline for Legal Case Retrieval

We describe our single submission to task 1 of COLIEE 2021. Our vanilla ...
research
04/22/2022

Certifiable Robot Design Optimization using Differentiable Programming

There is a growing need for computational tools to automatically design ...
research
09/12/2023

Accelerating Edge AI with Morpher: An Integrated Design, Compilation and Simulation Framework for CGRAs

Coarse-Grained Reconfigurable Arrays (CGRAs) hold great promise as power...
research
06/11/2019

A Linear Algorithm for Minimum Dominator Colorings of Orientations of Paths

In this paper we present an algorithm for finding a minimum dominator co...
research
06/20/2022

Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach

Affordance-centric Question-driven Task Completion for Egocentric Assist...
research
03/29/2023

Pgx: Hardware-accelerated parallel game simulation for reinforcement learning

We propose Pgx, a collection of board game simulators written in JAX. Th...
research
04/04/2021

Hierarchical Image Peeling: A Flexible Scale-space Filtering Framework

The importance of hierarchical image organization has been witnessed by ...

Please sign up or login with your details

Forgot password? Click here to reset