Segmenting the Future

04/24/2019
by   Hsu-kuang Chiu, et al.
16

Predicting the future is an important aspect for decision-making in robotics or autonomous driving systems, which heavily rely upon visual scene understanding. While prior work attempts to predict future video pixels, anticipate activities or forecast future scene semantic segments from segmentation of the preceding frames, methods that predict future semantic segmentation solely from the previous frame RGB data in a single end-to-end trainable model do not exist. In this paper, we propose a temporal encoder-decoder network architecture that encodes RGB frames from the past and decodes the future semantic segmentation. The network is coupled with a new knowledge distillation training framework specifically for the forecasting task. Our method, only seeing preceding video frames, implicitly models the scene segments while simultaneously accounting for the object dynamics to infer the future scene semantic segments. Our results on Cityscapes outperform the baseline and current state-of-the-art methods. Code is available at https://github.com/eddyhkchiu/segmenting_the_future/.

READ FULL TEXT

page 1

page 4

page 8

page 12

page 13

research
03/22/2017

Predicting Deeper into the Future of Semantic Segmentation

The ability to predict and therefore to anticipate the future is an impo...
research
07/20/2018

Future Semantic Segmentation with Convolutional LSTM

We consider the problem of predicting semantic segmentation of future fr...
research
11/28/2018

Future Segmentation Using 3D Structure

Predicting the future to anticipate the outcome of events and actions is...
research
03/21/2019

Value of Temporal Dynamics Information in Driving Scene Segmentation

Semantic scene segmentation has primarily been addressed by forming repr...
research
05/20/2017

Forecasting Hands and Objects in Future Frames

This paper presents an approach to forecast future presence and location...
research
12/27/2018

Future semantic segmentation of time-lapsed videos with large temporal displacement

An important aspect of video understanding is the ability to predict the...
research
11/13/2019

Location-aware Upsampling for Semantic Segmentation

Many successful learning targets such as dice loss and cross-entropy los...

Please sign up or login with your details

Forgot password? Click here to reset