Image coding for machines: an end-to-end learned approach

08/23/2021
by   Nam Le, et al.
13

Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images. Given the dramatic explosion in the number of images generated per day, a question arises: how much better would an image codec targeting machine-consumption perform against state-of-the-art codecs targeting human-consumption? In this paper, we propose an image codec for machines which is neural network (NN) based and end-to-end learned. In particular, we propose a set of training strategies that address the delicate problem of balancing competing loss functions, such as computer vision task losses, image distortion losses, and rate loss. Our experimental results show that our NN-based codec outperforms the state-of-the-art Versa-tile Video Coding (VVC) standard on the object detection and instance segmentation tasks, achieving -37.87 thanks to its compact size. To the best of our knowledge, this is the first end-to-end learned machine-targeted image codec.

READ FULL TEXT
research
05/26/2023

Rate-Distortion Theory in Coding for Machines and its Application

Recent years have seen a tremendous growth in both the capability and po...
research
03/06/2021

End-to-end optimized image compression for multiple machine tasks

An increasing share of captured images and videos are transmitted for st...
research
08/23/2021

Learned Image Coding for Machines: A Content-Adaptive Approach

Today, according to the Cisco Annual Internet Report (2018-2023), the fa...
research
05/08/2018

PAD-Net: A Perception-Aided Single Image Dehazing Network

In this work, we investigate the possibility of replacing the ℓ_2 loss w...
research
09/21/2022

Rate-Distortion in Image Coding for Machines

In recent years, there has been a sharp increase in transmission of imag...
research
02/23/2022

An End-to-End Cascaded Image Deraining and Object Detection Neural Network

While the deep learning-based image deraining methods have made great pr...
research
04/26/2022

Understanding the Impact of Edge Cases from Occluded Pedestrians for ML Systems

Machine learning (ML)-enabled approaches are considered a substantial su...

Please sign up or login with your details

Forgot password? Click here to reset