Image Coding for Machines with Object Region Learning

08/27/2023
by   Takahiro Shindo, et al.
0

Compression technology is essential for efficient image transmission and storage. With the rapid advances in deep learning, images are beginning to be used for image recognition as well as for human vision. For this reason, research has been conducted on image coding for image recognition, and this field is called Image Coding for Machines (ICM). There are two main approaches in ICM: the ROI-based approach and the task-loss-based approach. The former approach has the problem of requiring an ROI-map as input in addition to the input image. The latter approach has the problems of difficulty in learning the task-loss, and lack of robustness because the specific image recognition model is used to compute the loss function. To solve these problems, we propose an image compression model that learns object regions. Our model does not require additional information as input, such as an ROI-map, and does not use task-loss. Therefore, it is possible to compress images for various image recognition models. In the experiments, we demonstrate the versatility of the proposed method by using three different image recognition models and three different datasets. In addition, we verify the effectiveness of our model by comparing it with previous methods.

READ FULL TEXT

page 3

page 4

research
04/03/2023

Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

With advances in image recognition technology based on deep learning, au...
research
06/28/2021

Deep Learning Image Recognition for Non-images

Powerful deep learning algorithms open an opportunity for solving non-im...
research
05/30/2023

VVC Extension Scheme for Object Detection Using Contrast Reduction

In recent years, video analysis using Artificial Intelligence (AI) has b...
research
12/07/2022

Overview Of Satellite Image Recognition Models

In this article, the analysis of existing models of satellite image reco...
research
03/25/2018

Image Recognition Using Scale Recurrent Neural Networks

Convolutional Neural Network(CNN) has been widely used for image recogni...
research
02/11/2019

Pornographic Image Recognition via Weighted Multiple Instance Learning

In the era of Internet, recognizing pornographic images is of great sign...
research
07/29/2022

Forensic License Plate Recognition with Compression-Informed Transformers

Forensic license plate recognition (FLPR) remains an open challenge in l...

Please sign up or login with your details

Forgot password? Click here to reset