AdaZoom: Adaptive Zoom Network for Multi-Scale Object Detection in Large Scenes

06/19/2021
by   Jingtao Xu, et al.
0

Detection in large-scale scenes is a challenging problem due to small objects and extreme scale variation. It is essential to focus on the image regions of small objects. In this paper, we propose a novel Adaptive Zoom (AdaZoom) network as a selective magnifier with flexible shape and focal length to adaptively zoom the focus regions for object detection. Based on policy gradient, we construct a reinforcement learning framework for focus region generation, with the reward formulated by object distributions. The scales and aspect ratios of the generated regions are adaptive to the scales and distribution of objects inside. We apply variable magnification according to the scale of the region for adaptive multi-scale detection. We further propose collaborative training to complementarily promote the performance of AdaZoom and the detection network. To validate the effectiveness, we conduct extensive experiments on VisDrone2019, UAVDT, and DOTA datasets. The experiments show AdaZoom brings a consistent and significant improvement over different detection networks, achieving state-of-the-art performance on these datasets, especially outperforming the existing methods by AP of 4.64

READ FULL TEXT

page 1

page 4

page 5

page 8

research
03/29/2023

MuRAL: Multi-Scale Region-based Active Learning for Object Detection

Obtaining large-scale labeled object detection dataset can be costly and...
research
08/20/2019

Consistent Scale Normalization for Object Recognition

Scale variation remains a challenge problem for object detection. Common...
research
06/16/2022

Delving into the Scale Variance Problem in Object Detection

Object detection has made substantial progress in the last decade, due t...
research
09/11/2019

Distortion-adaptive Salient Object Detection in 360^∘ Omnidirectional Images

Image-based salient object detection (SOD) has been extensively explored...
research
12/14/2015

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

It is well known that contextual and multi-scale representations are imp...
research
07/20/2022

Exploiting Domain Transferability for Collaborative Inter-level Domain Adaptive Object Detection

Domain adaptation for object detection (DAOD) has recently drawn much at...
research
01/10/2018

From Superpixel to Human Shape Modelling for Carried Object Detection

Detecting carried objects is one of the requirements for developing syst...

Please sign up or login with your details

Forgot password? Click here to reset