Geometry-Based Next Frame Prediction from Monocular Video

09/20/2016
by   Reza Mahjourian, et al.
0

We consider the problem of next frame prediction from video input. A recurrent convolutional neural network is trained to predict depth from monocular video input, which, along with the current video image and the camera trajectory, can then be used to compute the next frame. Unlike prior next-frame prediction approaches, we take advantage of the scene geometry and use the predicted depth for generating the next frame prediction. Our approach can produce rich next frame predictions which include depth information attached to each pixel. Another novel aspect of our approach is that it predicts depth from a sequence of images (e.g. in a video), rather than from a single still image. We evaluate the proposed approach on the KITTI dataset, a standard dataset for benchmarking tasks relevant to autonomous driving. The proposed method produces results which are visually and numerically superior to existing methods that directly predict the next frame. We show that the accuracy of depth prediction improves as more prior frames are considered.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

research
02/15/2018

Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints

We present a novel approach for unsupervised learning of depth and ego-m...
research
05/12/2022

Performing Video Frame Prediction of Microbial Growth with a Recurrent Neural Network

A Recurrent Neural Network (RNN) was used to perform video frame predict...
research
01/12/2021

Binary TTC: A Temporal Geofence for Autonomous Navigation

Time-to-contact (TTC), the time for an object to collide with the observ...
research
10/17/2021

Revealing Disocclusions in Temporal View Synthesis through Infilling Vector Prediction

We consider the problem of temporal view synthesis, where the goal is to...
research
12/10/2018

Visual Depth Mapping from Monocular Images using Recurrent Convolutional Neural Networks

A reliable sense-and-avoid system is critical to enabling safe autonomou...
research
03/15/2022

From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction

There have been numerous recently proposed methods for monocular depth p...
research
11/27/2018

Deep Learned Frame Prediction for Video Compression

Motion compensation is one of the most essential methods for any video c...

Please sign up or login with your details

Forgot password? Click here to reset