qBSA: Logic Design of a 32-bit Block-Skewed RSFQ Arithmetic Logic Unit

01/29/2020
by   Souvik Kundu, et al.
0

Single flux quantum (SFQ) circuits are an attractive beyond-CMOS technology because they promise two orders of magnitude lower power at clock frequencies exceeding 25 GHz.However, every SFQ gate is clocked creating very deep gate-level pipelines that are difficult to keep full, particularly for sequences that include data-dependent operations. This paper proposes to increase the throughput of SFQ pipelines by re-designing the datapath to accept and operate on least-significant bits (LSBs) clock cycles earlier than more significant bits. This skewed datapath approach reduces the latency of the LSB side which can be feedback earlier for use in subsequent data-dependent operations increasing their throughput. In particular,we propose to group the bits into 4-bit blocks that are operatedon concurrently and create block-skewed datapath units for 32-bit operation. This skewed approach allows a subsequent data-dependent operation to start evaluating as soon as the first 4-bit block completes. Using this general approach, we developa block-skewed MIPS-compatible 32-bit ALU. Our gate-level Verilog design improves the throughput of 32-bit data dependent operations by 2x and 1.5x compared to previously proposed 4-bit bit-slice and 32-bit Ladner-Fischer ALUs respectively.

READ FULL TEXT

page 1

page 3

research
04/11/2020

Bit-Parallel Vector Composability for Neural Acceleration

Conventional neural accelerators rely on isolated self-sufficient functi...
research
02/13/2019

Fast Parallel Integer Adder in Binary Representation

An integer adder for integers in the binary representation is one of the...
research
02/14/2019

ERSFQ 8-bit Parallel Arithmetic Logic Unit

We have designed and tested a parallel 8-bit ERSFQ arithmetic logic unit...
research
05/01/2023

Ternary Instantaneous Noise-based Logic

One of the possible representations of three-valued instantaneous noise-...
research
02/27/2023

Novel Efficient Scalable QCA XOR and Full Adder Designs

Circuit design based on Quantum-dots Cellular Automata technology offers...
research
01/31/2019

Algorithmic counting of nonequivalent compact Huffman codes

It is known that the following five counting problems lead to the same i...
research
07/25/2017

Optimizing Scrubbing by Netlist Analysis for FPGA Configuration Bit Classification and Floorplanning

Existing scrubbing techniques for SEU mitigation on FPGAs do not guarant...

Please sign up or login with your details

Forgot password? Click here to reset