BEA: Revisiting anchor-based object detection DNN using Budding Ensemble Architecture

by   Syed Sha Qutub, et al.

This paper introduces the Budding Ensemble Architecture (BEA), a novel reduced ensemble architecture for anchor-based object detection models. Object detection models are crucial in vision-based tasks, particularly in autonomous systems. They should provide precise bounding box detections while also calibrating their predicted confidence scores, leading to higher-quality uncertainty estimates. However, current models may make erroneous decisions due to false positives receiving high scores or true positives being discarded due to low scores. BEA aims to address these issues. The proposed loss functions in BEA improve the confidence score calibration and lower the uncertainty error, which results in a better distinction of true and false positives and, eventually, higher accuracy of the object detection models. Both Base-YOLOv3 and SSD models were enhanced using the BEA method and its proposed loss functions. The BEA on Base-YOLOv3 trained on the KITTI dataset results in a 6 and 3.7 uncertainty estimation threshold to discard samples in real-time even leads to a 9.6 the area under the AP50-based retention curve used to measure the quality of calibration of confidence scores. Furthermore, BEA-YOLOV3 trained on KITTI provides superior out-of-distribution detection on Citypersons, BDD100K, and COCO datasets compared to the ensembles and vanilla models of YOLOv3 and Gaussian-YOLOv3.


page 2

page 4

page 17

page 18

page 19

page 20

page 21

page 22


Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection

Deep neural networks (DNNs) have enabled astounding progress in several ...

Improving Automated Sonar Video Analysis to Notify About Jellyfish Blooms

Human enterprise often suffers from direct negative effects caused by je...

XC: Exploring Quantitative Use Cases for Explanations in 3D Object Detection

Explainable AI (XAI) methods are frequently applied to obtain qualitativ...

Wildfire Smoke Detection with Computer Vision

Wildfires are becoming more frequent and their effects more devastating ...

MetaDetect: Uncertainty Quantification and Prediction Quality Estimates for Object Detection

In object detection with deep neural networks, the box-wise objectness s...

The MCC approaches the geometric mean of precision and recall as true negatives approach infinity

The performance of a binary classifier is described by a confusion matri...

Inner-Scene Similarities as a Contextual Cue for Object Detection

Using image context is an effective approach for improving object detect...

Please sign up or login with your details

Forgot password? Click here to reset