Pruned Lightweight Encoders for Computer Vision

11/23/2022
by   Jakub Žádník, et al.
0

Latency-critical computer vision systems, such as autonomous driving or drone control, require fast image or video compression when offloading neural network inference to a remote computer. To ensure low latency on a near-sensor edge device, we propose the use of lightweight encoders with constant bitrate and pruned encoding configurations, namely, ASTC and JPEG XS. Pruning introduces significant distortion which we show can be recovered by retraining the neural network with compressed data after decompression. Such an approach does not modify the network architecture or require coding format modifications. By retraining with compressed datasets, we reduced the classification accuracy and segmentation mean intersection over union (mIoU) degradation due to ASTC compression to 4.9-5.0 percentage points (pp) and 4.4-4.0 pp, respectively. With the same method, the mIoU lost due to JPEG XS compression at the main profile was restored to 2.7-2.3 pp. In terms of encoding speed, our ASTC encoder implementation is 2.3x faster than JPEG. Even though the JPEG XS reference encoder requires optimizations to reach low latency, we showed that disabling significance flag coding saves 22-23 0.4-0.3 mIoU after retraining.

READ FULL TEXT

page 1

page 5

research
03/21/2023

Lightweight Hybrid Video Compression Framework Using Reference-Guided Restoration Network

Recent deep-learning-based video compression methods brought coding gain...
research
08/15/2022

Task Oriented Video Coding: A Survey

Video coding technology has been continuously improved for higher compre...
research
02/16/2022

Practical Network Acceleration with Tiny Sets

Network compression is effective in accelerating the inference of deep n...
research
11/24/2022

Attention-based Feature Compression for CNN Inference Offloading in Edge Computing

This paper studies the computational offloading of CNN inference in devi...
research
11/03/2021

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outp...
research
06/29/2023

End-to-End Learnable Multi-Scale Feature Compression for VCM

The proliferation of deep learning-based machine vision applications has...
research
04/05/2021

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency

We propose a dynamic encoder transducer (DET) for on-device speech recog...

Please sign up or login with your details

Forgot password? Click here to reset