Side-Aware Boundary Localization for More Precise Object Detection

by   Jiaqi Wang, et al.

Current object detection frameworks mainly rely on bounding box regression to localize objects. Despite the remarkable progress in recent years, the precision of bounding box regression remains unsatisfactory, hence limiting performance in object detection. We observe that precise localization requires careful placement of each side of the bounding box. However, the mainstream approach, which focuses on predicting centers and sizes, is not the most effective way to accomplish this task, especially when there exists displacements with large variance between the anchors and the targets.In this paper, we propose an alternative approach, named as Side-Aware Boundary Localization (SABL), where each side of the bounding box is respectively localized with a dedicated network branch. Moreover, to tackle the difficulty of precise localization in the presence of displacements with large variance, we further propose a two-step localization scheme, which first predicts a range of movement through bucket prediction and then pinpoints the precise position within the predicted bucket. We test the proposed method on both two-stage and single-stage detection frameworks. Replacing the standard bounding box regression branch with the proposed design leads to significant improvements on Faster R-CNN, RetinaNet, and Cascade R-CNN, by 3.0 respectively. Code and models will be available at


page 1

page 3

page 4

page 13


Boundary Distribution Estimation to Precise Object Detection

In principal modern detectors, the task of object localization is implem...

Lymphocyte counting – Error Analysis of Regression versus Bounding Box Detection Approaches

We consider the problem of counting cell nuclei from celltype-agnostic h...

nnDetection for Intracranial Aneurysms Detection and Localization

Intracranial aneurysms are a commonly occurring and life-threatening con...

Part Detector Discovery in Deep Convolutional Neural Networks

Current fine-grained classification approaches often rely on a robust lo...

Learning Fixation Point Strategy for Object Detection and Classification

We propose a novel recurrent attentional structure to localize and recog...

PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy

Many recently developed object detectors focused on coarse-to-fine frame...

I see what you hear: a vision-inspired method to localize words

This paper explores the possibility of using visual object detection tec...

Please sign up or login with your details

Forgot password? Click here to reset