Multi-View Priors for Learning Detectors from Sparse Viewpoint Data

12/20/2013
by   Bojan Pepik, et al.
0

While the majority of today's object class models provide only 2D bounding boxes, far richer output hypotheses are desirable including viewpoint, fine-grained category, and 3D geometry estimate. However, models trained to provide richer output require larger amounts of training data, preferably well covering the relevant aspects such as viewpoint and fine-grained categories. In this paper, we address this issue from the perspective of transfer learning, and design an object class model that explicitly leverages correlations between visual features. Specifically, our model represents prior distributions over permissible multi-view detectors in a parametric way -- the priors are learned once from training data of a source object class, and can later be used to facilitate the learning of a detector for a target class. As we show in our experiments, this transfer is not only beneficial for detectors based on basic-level category representations, but also enables the robust learning of detectors that represent classes at finer levels of granularity, where training data is typically even scarcer and more unbalanced. As a result, we report largely improved performance in simultaneous 2D object localization and viewpoint estimation on a recent dataset of challenging street scenes.

READ FULL TEXT

page 5

page 7

page 13

research
03/25/2023

Viewpoint Equivariance for Multi-View 3D Object Detection

3D object detection from visual sensors is a cornerstone capability of r...
research
08/28/2020

Few-Shot Object Detection via Knowledge Transfer

Conventional methods for object detection usually require substantial am...
research
11/18/2014

Towards Scene Understanding with Detailed 3D Object Representations

Current approaches to semantic image and scene understanding typically e...
research
07/06/2020

Learning a Domain Classifier Bank for Unsupervised Adaptive Object Detection

In real applications, object detectors based on deep networks still face...
research
02/24/2017

Viewpoint Adaptation for Rigid Object Detection

An object detector performs suboptimally when applied to image data take...
research
12/22/2021

Class-aware Sounding Objects Localization via Audiovisual Correspondence

Audiovisual scenes are pervasive in our daily life. It is commonplace fo...
research
09/21/2018

Analysing object detectors from the perspective of co-occurring object categories

The accuracy of state-of-the-art Faster R-CNN and YOLO object detectors ...

Please sign up or login with your details

Forgot password? Click here to reset