Multimodal Future Localization and Emergence Prediction for Objects in Egocentric View with a Reachability Prior

06/08/2020
by   Osama Makansi, et al.
9

In this paper, we investigate the problem of anticipating future dynamics, particularly the future location of other vehicles and pedestrians, in the view of a moving vehicle. We approach two fundamental challenges: (1) the partial visibility due to the egocentric view with a single RGB camera and considerable field-of-view change due to the egomotion of the vehicle; (2) the multimodality of the distribution of future states. In contrast to many previous works, we do not assume structural knowledge from maps. We rather estimate a reachability prior for certain classes of objects from the semantic map of the present image and propagate it into the future using the planned egomotion. Experiments show that the reachability prior combined with multi-hypotheses learning improves multimodal prediction of the future location of tracked objects and, for the first time, the emergence of new objects. We also demonstrate promising zero-shot transfer to unseen datasets. Source code is available at $\href{https://github.com/lmb-freiburg/FLN-EPN-RPN}{\text{this https URL.}}$

READ FULL TEXT

page 1

page 2

page 4

page 5

page 8

page 13

research
09/15/2022

PIZZA: A Powerful Image-only Zero-Shot Zero-CAD Approach to 6 DoF Tracking

Estimating the relative pose of a new object without prior knowledge is ...
research
08/06/2020

Zero-Shot Multi-View Indoor Localization via Graph Location Networks

Indoor localization is a fundamental problem in location-based applicati...
research
05/13/2022

KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning

The goal of open-world compositional zero-shot learning (OW-CZSL) is to ...
research
04/19/2023

Anything-3D: Towards Single-view Anything Reconstruction in the Wild

3D reconstruction from a single-RGB image in unconstrained real-world sc...
research
08/11/2021

Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation

Location and appearance are the key cues for video object segmentation. ...
research
09/29/2022

SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis

We study the problem of novel view synthesis of objects from a single im...
research
08/10/2023

Follow Anything: Open-set detection, tracking, and following in real-time

Tracking and following objects of interest is critical to several roboti...

Please sign up or login with your details

Forgot password? Click here to reset