On the Efficacy of Multi-scale Data Samplers for Vision Applications

09/08/2023
by   Elvis Nunez, et al.
0

Multi-scale resolution training has seen an increased adoption across multiple vision tasks, including classification and detection. Training with smaller resolutions enables faster training at the expense of a drop in accuracy. Conversely, training with larger resolutions has been shown to improve performance, but memory constraints often make this infeasible. In this paper, we empirically study the properties of multi-scale training procedures. We focus on variable batch size multi-scale data samplers that randomly sample an input resolution at each training iteration and dynamically adjust their batch size according to the resolution. Such samplers have been shown to improve model accuracy beyond standard training with a fixed batch size and resolution, though it is not clear why this is the case. We explore the properties of these data samplers by performing extensive experiments on ResNet-101 and validate our conclusions across multiple architectures, tasks, and datasets. We show that multi-scale samplers behave as implicit data regularizers and accelerate training speed. Compared to models trained with single-scale samplers, we show that models trained with multi-scale samplers retain or improve accuracy, while being better-calibrated and more robust to scaling and data distribution shifts. We additionally extend a multi-scale variable batch sampler with a simple curriculum that progressively grows resolutions throughout training, allowing for a compute reduction of more than 30 instance segmentation tasks, where we observe a 37 along with a 3-4

READ FULL TEXT
research
05/23/2018

SNIPER: Efficient Multi-Scale Training

We present SNIPER, an algorithm for performing efficient multi-scale tra...
research
03/29/2021

Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

This paper presents a new Vision Transformer (ViT) architecture Multi-Sc...
research
12/01/2022

ResFormer: Scaling ViTs with Multi-Resolution Training

Vision Transformers (ViTs) have achieved overwhelming success, yet they ...
research
11/30/2022

Adaptive adversarial training method for improving multi-scale GAN based on generalization bound theory

In recent years, multi-scale generative adversarial networks (GANs) have...
research
05/31/2022

CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping

We present a simple method, CropMix, for the purpose of producing a rich...
research
07/06/2022

Multi-scale Sinusoidal Embeddings Enable Learning on High Resolution Mass Spectrometry Data

Small molecules in biological samples are studied to provide information...
research
12/08/2017

Weaving Multi-scale Context for Single Shot Detector

Aggregating context information from multiple scales has been proved to ...

Please sign up or login with your details

Forgot password? Click here to reset