Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

07/10/2022
by   Lei Yang, et al.
11

Monocular 3D object detection is an essential perception task for autonomous driving. However, the high reliance on large-scale labeled data make it costly and time-consuming during model optimization. To reduce such over-reliance on human annotations, we propose Mix-Teaching, an effective semi-supervised learning framework applicable to employ both labeled and unlabeled images in training stage. Mix-Teaching first generates pseudo-labels for unlabeled images by self-training. The student model is then trained on the mixed images possessing much more intensive and precise labeling by merging instance-level image patches into empty backgrounds or labeled images. This is the first to break the image-level limitation and put high-quality pseudo labels from multi frames into one image for semi-supervised training. Besides, as a result of the misalignment between confidence score and localization quality, it's hard to discriminate high-quality pseudo-labels from noisy predictions using only confidence-based criterion. To that end, we further introduce an uncertainty-based filter to help select reliable pseudo boxes for the above mixing operation. To the best of our knowledge, this is the first unified SSL framework for monocular 3D object detection. Mix-Teaching consistently improves MonoFlex and GUPNet by significant margins under various labeling ratios on KITTI dataset. For example, our method achieves around +6.34 improvement against the GUPNet baseline on validation set when using only 10 labeled data. Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4.65 on AP@0.7 for car detection, reaching 18.54 among all monocular based methods on KITTI test leaderboard. The code and pretrained models will be released at https://github.com/yanglei18/Mix-Teaching.

READ FULL TEXT

page 4

page 5

page 9

research
11/14/2022

Boosting Semi-Supervised 3D Object Detection with Semi-Sampling

Current 3D object detection methods heavily rely on an enormous amount o...
research
08/15/2022

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

Image-based 3D detection is an indispensable component of the perception...
research
09/02/2020

Monocular 3D Detection with Geometric Constraints Embedding and Semi-supervised Training

In this work, we propose a novel single-shot and keypoints-based framewo...
research
09/04/2023

SSVOD: Semi-Supervised Video Object Detection with Sparse Annotations

Despite significant progress in semi-supervised learning for image objec...
research
06/14/2023

Semi-supervised Cell Recognition under Point Supervision

Cell recognition is a fundamental task in digital histopathology image a...
research
03/17/2022

DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection

While numerous 3D detection works leverage the complementary relationshi...
research
07/17/2023

Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning

We propose a novel semi-supervised active learning (SSAL) framework for ...

Please sign up or login with your details

Forgot password? Click here to reset