An Area-Efficient FPGA Overlay using DSP Block based Time-multiplexed Functional Units

06/21/2016
by   Xiangwei Li, et al.
0

Coarse grained overlay architectures improve FPGA design productivity by providing fast compilation and software-like programmability. Throughput oriented spatially configurable overlays typically suffer from area overheads due to the requirement of one functional unit for each compute kernel operation. Hence, these overlays have often been of limited size, supporting only relatively small compute kernels while consuming considerable FPGA resources. This paper examines the possibility of sharing the functional units among kernel operations for reducing area overheads. We propose a linear interconnected array of time-multiplexed FUs as an overlay architecture with reduced instruction storage and interconnect resource requirements, which uses a fully-pipelined, architecture-aware FU design supporting a fast context switching time. The results presented show a reduction of up to 85 resource requirements compared to existing throughput oriented overlay architectures, with an operating frequency which approaches the theoretical limit for the FPGA device.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2017

Resource-Aware Just-in-Time OpenCL Compiler for Coarse-Grained FPGA Overlays

FPGA vendors have recently started focusing on OpenCL for FPGAs because ...
research
06/21/2016

Reliability-Aware Overlay Architectures for FPGAs: Features and Design Challenges

The FPGA overlay architectures have been mainly proposed to improve desi...
research
03/18/2018

Towards an Area-Efficient Implementation of a High ILP EDGE Soft Processor

In-order scalar RISC architectures have been the dominant paradigm in FP...
research
01/31/2020

Automatic Generation of Application-Specific FPGA Overlays with RapidWright

Overlay architectures implemented on FPGA devices have been proposed as ...
research
11/11/2019

DRAB-LOCUS: An Area-Efficient AES Architecture for Hardware Accelerator Co-Location on FPGAs

Advanced Encryption Standard (AES) implementations on Field Programmable...
research
04/08/2023

BRAMAC: Compute-in-BRAM Architectures for Multiply-Accumulate on FPGAs

Deep neural network (DNN) inference using reduced integer precision has ...
research
12/06/2016

Memory Efficient Multi-Scale Line Detector Architecture for Retinal Blood Vessel Segmentation

This paper presents a memory efficient architecture that implements the ...

Please sign up or login with your details

Forgot password? Click here to reset