Revisiting the Sibling Head in Object Detector

by   Guanglu Song, et al.

The “shared head for classification and localization” (sibling head), firstly denominated in Fast RCNN <cit.>, has been leading the fashion of the object detection community in the past five years. This paper provides the observation that the spatial misalignment between the two object functions in the sibling head can considerably hurt the training process, but this misalignment can be resolved by a very simple operator called task-aware spatial disentanglement (TSD). Considering the classification and regression, TSD decouples them from the spatial dimension by generating two disentangled proposals for them, which are estimated by the shared proposal. This is inspired by the natural insight that for one instance, the features in some salient area may have rich information for classification while these around the boundary may be good at bounding box regression. Surprisingly, this simple design can boost all backbones and models on both MS COCO and Google OpenImage consistently by  3 enlarge the performance margin between the disentangled and the shared proposals, and gain  1 upper bound of nowadays single-model detector by a large margin (mAP 49.4 with ResNet-101, 51.2 with SENet154), and is the core model of our 1st place solution on the Google OpenImage Challenge 2019.


page 1

page 3

page 8


1st Place Solutions for OpenImage2019 – Object Detection and Instance Segmentation

This article introduces the solutions of the two champion teams, `MMfrui...

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

The past few years have seen an increased interest in aerial image objec...

Rethinking the Aligned and Misaligned Features in One-stage Object Detection

One-stage object detectors rely on the point feature to predict the dete...

Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization

The problem of computing category agnostic bounding box proposals is uti...

Rethinking Classification and Localization in R-CNN

Modern R-CNN based detectors share the RoI feature extractor head for bo...

Mutual Supervision for Dense Object Detection

The classification and regression head are both indispensable components...

SalProp: Salient object proposals via aggregated edge cues

In this paper, we propose a novel object proposal generation scheme by f...

Please sign up or login with your details

Forgot password? Click here to reset