Efficient Floating-Point Givens Rotation Unit

10/23/2020
by   Javier Hormigo, et al.
0

High-throughput QR decomposition is a key operation in many advanced signal processing and communication applications. For some of these applications, using floating-point computation is becoming almost compulsory. However, there are scarce works in hardware implementations of floating-point QR decomposition for embedded systems. In this paper, we propose a very efficient high-throughput floating-point Givens rotation unit for QR decomposition. Moreover, the initial proposed design for conventional number formats is enhanced by using the new Half-Unit Biased format. The provided error analysis shows the effectiveness of our proposals and the trade-off of different implementation parameters. FPGA implementation results are also presented and a thorough comparison between both approaches. These implementation results also reveal outstanding improvements compared to other previous similar designs in terms of area, latency, and throughput.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2019

An efficient floating point multiplier design for high speed applications using Karatsuba algorithm and Urdhva-Tiryagbhyam algorithm

Floating point multiplication is a crucial operation in high power compu...
research
04/29/2017

A floating point division unit based on Taylor-Series expansion algorithm and Iterative Logarithmic Multiplier

Floating point division, even though being an infrequent operation in th...
research
09/26/2021

Efficient Non-linear Calculators

A novel algorithm for producing smooth nonlinearities on digital hardwar...
research
05/21/2018

Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines

Deep learning as a means to inferencing has proliferated thanks to its v...
research
03/12/2020

Proposal of a Takagi-Sugeno Fuzzy-PI Controller Hardware

This work proposes dedicated hardware for an intelligent control system ...
research
02/03/2023

PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications

Posit has been a promising alternative to the IEEE-754 floating point fo...
research
06/15/2011

A Characterization of the SPARC T3-4 System

This technical report covers a set of experiments on the 64-core SPARC T...

Please sign up or login with your details

Forgot password? Click here to reset