USegScene: Unsupervised Learning of Depth, Optical Flow and Ego-Motion with Semantic Guidance and Coupled Networks

07/15/2022
by   Johan Vertens, et al.
15

In this paper we propose USegScene, a framework for semantically guided unsupervised learning of depth, optical flow and ego-motion estimation for stereo camera images using convolutional neural networks. Our framework leverages semantic information for improved regularization of depth and optical flow maps, multimodal fusion and occlusion filling considering dynamic rigid object motions as independent SE(3) transformations. Furthermore, complementary to pure photo-metric matching, we propose matching of semantic features, pixel-wise classes and object instance borders between the consecutive images. In contrast to previous methods, we propose a network architecture that jointly predicts all outputs using shared encoders and allows passing information across the task-domains, e.g., the prediction of optical flow can benefit from the prediction of the depth. Furthermore, we explicitly learn the depth and optical flow occlusion maps inside the network, which are leveraged in order to improve the predictions in therespective regions. We present results on the popular KITTI dataset and show that our approach outperforms other methods by a large margin.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 10

page 11

page 12

research
10/08/2018

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

Learning depth and optical flow via deep neural networks by watching vid...
research
11/16/2017

Occlusion Aware Unsupervised Learning of Optical Flow

It has been recently shown that a convolutional neural network can learn...
research
09/05/2018

DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency

We present an unsupervised learning framework for simultaneously trainin...
research
09/23/2018

Unsupervised Learning of Dense Optical Flow and Depth from Sparse Event Data

In this work we present unsupervised learning of depth and motion from s...
research
02/02/2015

Learning the Matching Function

The matching function for the problem of stereo reconstruction or optica...
research
04/06/2016

Exploiting Semantic Information and Deep Matching for Optical Flow

We tackle the problem of estimating optical flow from a monocular camera...
research
12/07/2016

DeMoN: Depth and Motion Network for Learning Monocular Stereo

In this paper we formulate structure from motion as a learning problem. ...

Please sign up or login with your details

Forgot password? Click here to reset