LUXOR: An FPGA Logic Cell Architecture for Efficient Compressor Tree Implementations

03/06/2020
by   Seyedramin Rasoulinezhad, et al.
0

We propose two tiers of modifications to FPGA logic cell architecture to deliver a variety of performance and utilization benefits with only minor area overheads. In the irst tier, we augment existing commercial logic cell datapaths with a 6-input XOR gate in order to improve the expressiveness of each element, while maintaining backward compatibility. This new architecture is vendor-agnostic, and we refer to it as LUXOR. We also consider a secondary tier of vendor-speciic modifications to both Xilinx and Intel FPGAs, which we refer to as X-LUXOR+ and I-LUXOR+ respectively. We demonstrate that compressor tree synthesis using generalized parallel counters (GPCs) is further improved with the proposed modifications. Using both the Intel adaptive logic module and the Xilinx slice at the 65nm technology node for a comparative study, it is shown that the silicon area overhead is less than 0.5 LUXOR+, while the delay increments are 1-6 demonstrate that LUXOR can deliver an average reduction of 13-19 utilization on micro-benchmarks from a variety of domains.BNN benchmarks benefit the most with an average reduction of 37-47 which is due to the highly-efficient mapping of the XnorPopcount operation on our proposed LUXOR+ logic cells.

READ FULL TEXT

page 2

page 9

page 10

research
04/17/2022

A Novel ASIC Design Flow using Weight-Tunable Binary Neurons as Standard Cells

In this paper, we describe a design of a mixed signal circuit for a bina...
research
07/16/2020

Area- Efficient VLSI Implementation of Serial-In Parallel-Out Multiplier Using Polynomial Representation in Finite Field GF(2m)

Finite field multiplier is mainly used in elliptic curve cryptography, e...
research
02/27/2020

MajorityNets: BNNs Utilising Approximate Popcount for Improved Efficiency

Binarized neural networks (BNNs) have shown exciting potential for utili...
research
12/20/2018

Efficient logic architecture in training gradient boosting decision tree for high-performance and edge computing

This study proposes a logic architecture for the high-speed and power ef...
research
09/02/2020

CONTRA: Area-Constrained Technology Mapping Framework For Memristive Memory Processing Unit

Data-intensive applications are poised to benefit directly from processi...
research
12/04/2021

Logic Shrinkage: Learned FPGA Netlist Sparsity for Efficient Neural Network Inference

FPGA-specific DNN architectures using the native LUTs as independently t...
research
08/16/2021

High-Resolution Waveform Capture Device on a Cyclone-V FPGA

We introduce the waveform capture device (WCD), a flexible measurement s...

Please sign up or login with your details

Forgot password? Click here to reset