R-TOSS: A Framework for Real-Time Object Detection using Semi-Structured Pruning

03/03/2023
by   Abhishek Balasubramaniam, et al.
0

Object detectors used in autonomous vehicles can have high memory and computational overheads. In this paper, we introduce a novel semi-structured pruning framework called R-TOSS that overcomes the shortcomings of state-of-the-art model pruning techniques. Experimental results on the JetsonTX2 show that R-TOSS has a compression rate of 4.4x on the YOLOv5 object detector with a 2.15x speedup in inference time and 57.01 usage. R-TOSS also enables 2.89x compression on RetinaNet with a 1.86x speedup in inference time and 56.31 significant improvements compared to various state-of-the-art pruning techniques.

READ FULL TEXT

page 2

page 5

page 6

research
02/07/2023

ZipLM: Hardware-Aware Structured Pruning of Language Models

The breakthrough performance of large language models (LLMs) comes with ...
research
10/08/2021

Performance optimizations on deep noise suppression models

We study the role of magnitude structured pruning as an architecture sea...
research
07/06/2019

AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates

Structured weight pruning is a representative model compression techniqu...
research
07/06/2019

AutoSlim: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates

Structured weight pruning is a representative model compression techniqu...
research
02/04/2021

Compressed Object Detection

Deep learning approaches have achieved unprecedented performance in visu...
research
02/28/2023

Fast as CHITA: Neural Network Pruning with Combinatorial Optimization

The sheer size of modern neural networks makes model serving a serious c...
research
08/16/2023

Reproducing Kernel Hilbert Space Pruning for Sparse Hyperspectral Abundance Prediction

Hyperspectral measurements from long range sensors can give a detailed p...

Please sign up or login with your details

Forgot password? Click here to reset