Robust Object Detection with Multi-input Multi-output Faster R-CNN

11/25/2021
by   Sebastian Cygert, et al.
1

Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the real-world in out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. however, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work, a generalization of the MIMO approach is applied to the task of object detection using the general-purpose Faster R-CNN model. It was shown that using the MIMO framework allows building strong feature representation and obtains very competitive accuracy when using just two input/output pairs. Furthermore, it adds just 0.5% additional model parameters and increases the inference time by 15.9% when compared to the standard Faster R-CNN. It also works comparably to, or outperforms the Deep Ensemble approach in terms of model accuracy, robustness to out-of-distribution setting, and uncertainty calibration when the same number of predictions is used. This work opens up avenues for applying the MIMO approach in other high-level tasks such as semantic segmentation and depth estimation.

READ FULL TEXT
research
10/13/2020

Training independent subnetworks for robust prediction

Recent approaches to efficiently ensemble neural networks have shown tha...
research
06/10/2016

Face Detection with the Faster R-CNN

The Faster R-CNN has recently demonstrated impressive results on various...
research
06/01/2022

LiDAR-MIMO: Efficient Uncertainty Estimation for LiDAR-based 3D Object Detection

The estimation of uncertainty in robotic vision, such as 3D object detec...
research
02/10/2019

NeurAll: Towards a Unified Model for Visual Perception in Automated Driving

Convolutional Neural Networks (CNNs) are successfully used for the impor...
research
05/20/2022

Towards efficient feature sharing in MIMO architectures

Multi-input multi-output architectures propose to train multiple subnetw...
research
01/13/2018

Inverted Residuals and Linear Bottlenecks: Mobile Networks forClassification, Detection and Segmentation

In this paper we describe a new mobile architecture, MobileNetV2, that i...
research
12/06/2021

Simultaneously Predicting Multiple Plant Traits from Multiple Sensors via Deformable CNN Regression

Trait measurement is critical for the plant breeding and agricultural pr...

Please sign up or login with your details

Forgot password? Click here to reset