A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

03/16/2023
by   Wenqian Zhao, et al.
0

Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and accelerated via a novel dictionary selective strategy. Besides, the hardware programming architecture together with the model structure is analyzed to guide the optimal design of computation kernels to minimize the inference latency under the resource constraints. With these novel techniques, the communication and computation bottlenecks in the deep dictionary learning-based SR models are tackled perfectly. The experiments on the edge embedded NVIDIA NX and 2080Ti show that our method outperforms the state-of-the-art NVIDIA TensorRT significantly, and can achieve real-time performance.

READ FULL TEXT
research
03/29/2016

FAST: A Framework to Accelerate Super-Resolution Processing on Compressed Videos

State-of-the-art super-resolution (SR) algorithms require significant co...
research
06/11/2019

Hybrid Function Sparse Representation towards Image Super Resolution

Sparse representation with training-based dictionary has been shown succ...
research
03/12/2017

Local Patch Classification Based Framework for Single Image Super-Resolution

Recent learning-based super-resolution (SR) methods often focus on the d...
research
05/29/2015

Fast Computation of PERCLOS and Saccadic Ratio

This thesis describes the development of fast algorithms for the computa...
research
05/06/2021

Real-Time Video Super-Resolution by Joint Local Inference and Global Parameter Estimation

The state of the art in video super-resolution (SR) are techniques based...
research
07/12/2021

Real-Time Super-Resolution System of 4K-Video Based on Deep Learning

Video super-resolution (VSR) technology excels in reconstructing low-qua...
research
10/18/2021

Deploying Near-Optimal Delay-Constrained Paths with Segment Routing in Massive-Scale Networks

With a growing demand for quasi-instantaneous communication services suc...

Please sign up or login with your details

Forgot password? Click here to reset