Bootstrap Your Object Detector via Mixed Training

11/04/2021
by   Mengde Xu, et al.
29

We introduce MixTraining, a new training paradigm for object detection that can improve the performance of existing detectors for free. MixTraining enhances data augmentation by utilizing augmentations of different strengths while excluding the strong augmentations of certain training samples that may be detrimental to training. In addition, it addresses localization noise and missing labels in human annotations by incorporating pseudo boxes that can compensate for these errors. Both of these MixTraining capabilities are made possible through bootstrapping on the detector, which can be used to predict the difficulty of training on a strong augmentation, as well as to generate reliable pseudo boxes thanks to the robustness of neural networks to labeling error. MixTraining is found to bring consistent improvements across various detectors on the COCO dataset. In particular, the performance of Faster R-CNN <cit.> with a ResNet-50 <cit.> backbone is improved from 41.7 mAP to 44.0 mAP, and the accuracy of Cascade-RCNN <cit.> with a Swin-Small <cit.> backbone is raised from 50.9 mAP to 52.8 mAP. The code and models will be made publicly available at <https://github.com/MendelXu/MixTraining>.

READ FULL TEXT

page 2

page 4

page 5

page 9

research
09/09/2019

CBNet: A Novel Composite Backbone Network Architecture for Object Detection

In existing CNN based detectors, the backbone network is a very importan...
research
11/26/2020

TinaFace: Strong but Simple Baseline for Face Detection

Face detection has received intensive attention in recent years. Many wo...
research
07/24/2023

COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts

Practical object detection application can lose its effectiveness on ima...
research
10/15/2019

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

It is hard to detect on-road objects under various lighting conditions. ...
research
11/07/2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

We present a strong object detector with encoder-decoder pretraining and...
research
03/15/2022

Progressive End-to-End Object Detection in Crowded Scenes

In this paper, we propose a new query-based detection framework for crow...
research
04/15/2023

Align-DETR: Improving DETR with Simple IoU-aware BCE loss

DETR has set up a simple end-to-end pipeline for object detection by for...

Please sign up or login with your details

Forgot password? Click here to reset