Learning to Resize Images for Computer Vision Tasks

03/17/2021
by   Hossein Talebi, et al.
31

For all the ways convolutional neural nets have revolutionized computer vision in recent years, one important aspect has received surprisingly little attention: the effect of image size on the accuracy of tasks being trained for. Typically, to be efficient, the input images are resized to a relatively small spatial resolution (e.g. 224x224), and both training and inference are carried out at this resolution. The actual mechanism for this re-scaling has been an afterthought: Namely, off-the-shelf image resizers such as bilinear and bicubic are commonly used in most machine learning software frameworks. But do these resizers limit the on task performance of the trained networks? The answer is yes. Indeed, we show that the typical linear resizer can be replaced with learned resizers that can substantially improve performance. Importantly, while the classical resizers typically result in better perceptual quality of the downscaled images, our proposed learned resizers do not necessarily give better visual quality, but instead improve task performance. Our learned image resizer is jointly trained with a baseline vision model. This learned CNN-based resizer creates machine friendly visual manipulations that lead to a consistent improvement of the end task metric over the baseline model. Specifically, here we focus on the classification task with the ImageNet dataset, and experiment with four different models to learn resizers adapted to each model. Moreover, we show that the proposed resizer can also be useful for fine-tuning the classification baselines for other vision tasks. To this end, we experiment with three different baselines to develop image quality assessment (IQA) models on the AVA dataset.

READ FULL TEXT

page 2

page 3

page 6

page 7

research
04/06/2023

MULLER: Multilayer Laplacian Resizer for Vision

Image resizing operation is a fundamental preprocessing module in modern...
research
02/16/2021

Does deep machine vision have just noticeable difference (JND)?

As an important perceptual characteristic of the Human Visual System (HV...
research
02/09/2016

The Role of Typicality in Object Classification: Improving The Generalization Capacity of Convolutional Neural Networks

Deep artificial neural networks have made remarkable progress in differe...
research
10/28/2021

Characterizing and Taming Resolution in Convolutional Neural Networks

Image resolution has a significant effect on the accuracy and computatio...
research
11/19/2021

Neural Image Beauty Predictor Based on Bradley-Terry Model

Image beauty assessment is an important subject of computer vision. Ther...
research
05/26/2023

Image Quality Is Not All You Want: Task-Driven Lens Design for Image Classification

In computer vision, it has long been taken for granted that high-quality...
research
07/24/2022

Inter-model Interpretability: Self-supervised Models as a Case Study

Since early machine learning models, metrics such as accuracy and precis...

Please sign up or login with your details

Forgot password? Click here to reset