A Parametric Top-View Representation of Complex Road Scenes

12/14/2018
by   Ziyan Wang, et al.
0

In this paper, we address the problem of inferring the layout of complex road scenes given a single camera as input. To achieve that, we first propose a novel parameterized model of road layouts in a top-view representation, which is not only intuitive for human visualization but also provides an interpretable interface for higher-level decision making. Moreover, the design of our top-view scene model allows for efficient sampling and thus generation of large-scale simulated data, which we leverage to train a deep neural network to infer our scene model's parameters. Specifically, our proposed training procedure uses supervised domain-adaptation techniques to incorporate both simulated as well as manually annotated data. Finally, we design a Conditional Random Field (CRF) that enforces coherent predictions for a single frame and encourages temporal smoothness among video frames. Experiments on two public data sets show that: (1) Our parametric top-view model is representative enough to describe complex road scenes, (2) The proposed method outperforms baselines trained on manually-annotated or simulated data only, thus getting the best of both, (3) Our CRF is able to generate temporally smoothed while semantically meaningful results.

READ FULL TEXT

page 3

page 9

research
07/02/2020

Understanding Road Layout from Videos as a Whole

In this paper, we address the problem of inferring the layout of complex...
research
03/28/2018

Learning to Look around Objects for Top-View Representations of Outdoor Scenes

Given a single RGB image of a complex outdoor road scene in the perspect...
research
02/19/2020

MonoLayout: Amodal scene layout from a single image

In this paper, we address the novel, highly challenging problem of estim...
research
11/24/2018

Spatio-Temporal Road Scene Reconstruction using Superpixel MRF

Scene models construction based on image rendering is a hot topic in the...
research
04/14/2021

Weakly But Deeply Supervised Occlusion-Reasoned Parametric Layouts

We propose an end-to-end network that takes a single perspective RGB ima...
research
08/06/2023

"Kurosawa": A Script Writer's Assistant

Storytelling is the lifeline of the entertainment industry – movies, TV ...
research
06/29/2014

Fusion Based Holistic Road Scene Understanding

This paper addresses the problem of holistic road scene understanding ba...

Please sign up or login with your details

Forgot password? Click here to reset