Seeing without Looking: Contextual Rescoring of Object Detections for AP Maximization

12/27/2019
by   Lourenço V. Pato, et al.
14

The majority of current object detectors lack context: class predictions are made independently from other detections. We propose to incorporate context in object detection by post-processing the output of an arbitrary detector to rescore the confidences of its detections. Rescoring is done by conditioning on contextual information from the entire set of detections: their confidences, predicted classes, and positions. We show that AP can be improved by simply reassigning the detection confidence values such that true positives that survive longer (i.e., those with the correct class and large IoU) are scored higher than false positives or detections with small IoU. In this setting, we use a bidirectional RNN with attention for contextual rescoring and introduce a training target that uses the IoU with ground truth to maximize AP for the given set of detections. The fact that our approach does not require access to visual features makes it computationally inexpensive and agnostic to the detection architecture. In spite of this simplicity, our model consistently improves AP over strong pre-trained baselines (Cascade R-CNN and Faster R-CNN with several backbones), particularly by reducing the confidence of duplicate detections (a learned form of non-maximum suppression) and removing out-of-context objects by conditioning on the confidences, classes, positions, and sizes of the co-occurrent detections (e.g., a high-confidence detection of bird makes a detection of sports ball less likely).

READ FULL TEXT

page 1

page 2

page 3

page 4

page 8

page 11

page 13

page 14

research
12/03/2017

Cascade R-CNN: Delving into High Quality Object Detection

In object detection, an intersection over union (IoU) threshold is requi...
research
12/10/2020

OneNet: Towards End-to-End One-Stage Object Detection

End-to-end one-stage object detection trailed thus far. This paper disco...
research
07/04/2018

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Average precision (AP), the area under the recall-precision (RP) curve, ...
research
09/21/2018

Analysing object detectors from the perspective of co-occurring object categories

The accuracy of state-of-the-art Faster R-CNN and YOLO object detectors ...
research
07/24/2021

Rank Sort Loss for Object Detection and Instance Segmentation

We propose Rank Sort (RS) Loss, as a ranking-based loss function to ...
research
04/09/2020

Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

Weakly supervised learning has emerged as a compelling tool for object d...
research
02/01/2021

Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details

By design, average precision (AP) for object detection aims to treat all...

Please sign up or login with your details

Forgot password? Click here to reset