A Single-Channel Architecture for Algebraic Integer Based 8×8 2-D DCT Computation

10/27/2017
by   A. Edirisuriya, et al.
0

An area efficient row-parallel architecture is proposed for the real-time implementation of bivariate algebraic integer (AI) encoded 2-D discrete cosine transform (DCT) for image and video processing. The proposed architecture computes 8×8 2-D DCT transform based on the Arai DCT algorithm. An improved fast algorithm for AI based 1-D DCT computation is proposed along with a single channel 2-D DCT architecture. The design improves on the 4-channel AI DCT architecture that was published recently by reducing the number of integer channels to one and the number of 8-point 1-D DCT cores from 5 down to 2. The architecture offers exact computation of 8×8 blocks of the 2-D DCT coefficients up to the FRS, which converts the coefficients from the AI representation to fixed-point format using the method of expansion factors. Prototype circuits corresponding to FRS blocks based on two expansion factors are realized, tested, and verified on FPGA-chip, using a Xilinx Virtex-6 XC6VLX240T device. Post place-and-route results show a 20 of area compared to the 2-D DCT architecture requiring five 1-D AI cores. The area-time and area-time^2 complexity metrics are also reduced by 23 22 realizations are simulated up to place and route for ASICs using 45 nm CMOS standard cells. The maximum estimated clock rate is 951 MHz for the CMOS realizations indicating 7.608·10^9 pixels/seconds and a 8×8 block rate of 118.875 MHz.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2017

A Digital Hardware Fast Algorithm and FPGA-based Prototype for a Novel 16-point Approximate DCT for Image Compression Applications

The discrete cosine transform (DCT) is the key step in many image and vi...
research
06/02/2022

Block-Parallel Systolic-Array Architecture for 2-D NTT-based Fragile Watermark Embedding

Number-theoretic transforms (NTTs) have been applied in the fragile wate...
research
05/27/2016

An Orthogonal 16-point Approximate DCT for Image and Video Compression

A low-complexity orthogonal multiplierless approximation for the 16-poin...
research
10/30/2017

VLSI Computational Architectures for the Arithmetic Cosine Transform

The discrete cosine transform (DCT) is a widely-used and important signa...
research
04/05/2021

Near-Precise Parameter Approximation for Multiple Multiplications on A Single DSP Block

A multiply-accumulate (MAC) operation is the main computation unit for D...
research
06/05/2023

Segregated FLS Processing Cores for V/STOL Autonomous Landing Guidance Assistant System using FPGA

It is highly predicted that the roads and parking areas will be extremel...
research
12/24/2021

Fast and Scalable Computation of the Forward and Inverse Discrete Periodic Radon Transform

The Discrete Periodic Radon Transform (DPRT) has been extensively used i...

Please sign up or login with your details

Forgot password? Click here to reset