Bringing Generalization to Deep Multi-view Detection

09/24/2021
by   Jeet Vora, et al.
18

Multi-view Detection (MVD) is highly effective for occlusion reasoning and is a mainstream solution in various applications that require accurate top-view occupancy maps. While recent works using deep learning have made significant advances in the field, they have overlooked the generalization aspect, which makes them impractical for real-world deployment. The key novelty of our work is to formalize three critical forms of generalization and propose experiments to investigate them: i) generalization across a varying number of cameras, ii) generalization with varying camera positions, and finally, iii) generalization to new scenes. We find that existing models show poor generalization by overfitting to a single scene and camera configuration. We propose modifications in terms of pre-training, pooling strategy, regularization, and loss function to an existing state-of-the-art framework, leading to successful generalization across new camera configurations and new scenes. We perform a comprehensive set of experiments on the and datasets to (a) motivate the necessity to evaluate MVD methods on generalization abilities and (b) demonstrate the efficacy of the proposed approach. The code is publicly available at <https://github.com/jeetv/GMVD>

READ FULL TEXT

page 3

page 6

page 12

page 13

page 14

research
03/15/2023

RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters

Novel view synthesis (NVS) is a challenging task in computer vision that...
research
05/30/2023

Occ-BEV: Multi-Camera Unified Pre-training via 3D Scene Reconstruction

Multi-camera 3D perception has emerged as a prominent research field in ...
research
04/27/2021

Multi-view Deep One-class Classification: A Systematic Exploration

One-class classification (OCC), which models one single positive class a...
research
05/05/2021

FLEX: Parameter-free Multi-view 3D Human Motion Reconstruction

The increasing availability of video recordings made by multiple cameras...
research
01/13/2020

A Bayesian 3D Multi-view Multi-object Tracking Filter

This paper proposes an online multi-camera multi-object tracker that onl...
research
12/07/2021

Voxelized 3D Feature Aggregation for Multiview Detection

Multi-view detection incorporates multiple camera views to alleviate occ...
research
05/26/2020

SurfaceNet+: An End-to-end 3D Neural Network for Very Sparse Multi-view Stereopsis

Multi-view stereopsis (MVS) tries to recover the 3D model from 2D images...

Please sign up or login with your details

Forgot password? Click here to reset