Separable Convolutional LSTMs for Faster Video Segmentation

07/16/2019
by   Andreas Pfeuffer, et al.
3

Semantic Segmentation is an important module for autonomous robots such as self-driving cars. The advantage of video segmentation approaches compared to single image segmentation is that temporal image information is considered, and their performance increases due to this. Hence, single image segmentation approaches are extended by recurrent units such as convolutional LSTM (convLSTM) cells, which are placed at suitable positions in the basic network architecture. However, a major critique of video segmentation approaches based on recurrent neural networks is their large parameter count and their computational complexity, and so, their inference time of one video frame takes up to 66 percent longer than their basic version. Inspired by the success of the spatial and depthwise separable convolutional neural networks, we generalize these techniques for convLSTMs in this work, so that the number of parameters and the required FLOPs are reduced significantly. Experiments on different datasets show that the segmentation approaches using the proposed, modified convLSTM cells achieve similar or slightly worse accuracy, but are up to 15 percent faster on a GPU than the ones using the standard convLSTM cells. Furthermore, a new evaluation metric is introduced, which measures the amount of flickering pixels in the segmented video sequence.

READ FULL TEXT

page 1

page 5

research
05/03/2019

Semantic Segmentation of Video Sequences with Convolutional LSTMs

Most of the semantic segmentation approaches have been developed for sin...
research
07/01/2020

Robust Semantic Segmentation in Adverse Weather Conditions by means of Fast Video-Sequence Segmentation

Computer vision tasks such as semantic segmentation perform very well in...
research
06/01/2016

Recurrent Fully Convolutional Networks for Video Segmentation

Image segmentation is an important step in most visual tasks. While conv...
research
09/15/2019

Comparison of UNet, ENet, and BoxENet for Segmentation of Mast Cells in Scans of Histological Slices

Deep neural networks show high accuracy in the problem of semantic and i...
research
04/18/2021

Gaussian Dynamic Convolution for Efficient Single-Image Segmentation

Interactive single-image segmentation is ubiquitous in the scientific an...
research
04/03/2018

Dynamic Video Segmentation Network

In this paper, we present a detailed design of dynamic video segmentatio...
research
03/03/2020

SELD-TCN: Sound Event Localization Detection via Temporal Convolutional Networks

The understanding of the surrounding environment plays a critical role i...

Please sign up or login with your details

Forgot password? Click here to reset