Local Memory Attention for Fast Video Semantic Segmentation

01/05/2021
by   Matthieu Paul, et al.
4

We propose a novel neural network module that transforms an existing single-frame semantic segmentation model into a video semantic segmentation pipeline. In contrast to prior works, we strive towards a simple and general module that can be integrated into virtually any single-frame architecture. Our approach aggregates a rich representation of the semantic information in past frames into a memory module. Information stored in the memory is then accessed through an attention mechanism. This provides temporal appearance cues from prior frames, which are then fused with an encoding of the current frame through a second attention-based module. The segmentation decoder processes the fused representation to predict the final semantic segmentation. We integrate our approach into two popular semantic segmentation networks: ERFNet and PSPNet. We observe an improvement in segmentation performance on Cityscapes by 1.7 by only 1.5ms.

READ FULL TEXT

page 1

page 3

page 4

page 8

research
02/17/2021

Temporal Memory Attention for Video Semantic Segmentation

Video semantic segmentation requires to utilize the complex temporal rel...
research
12/26/2019

Efficient Video Semantic Segmentation with Labels Propagation and Refinement

This paper tackles the problem of real-time semantic segmentation of hig...
research
08/03/2020

Frame-To-Frame Consistent Semantic Segmentation

In this work, we aim for temporally consistent semantic segmentation thr...
research
12/27/2018

Future semantic segmentation of time-lapsed videos with large temporal displacement

An important aspect of video understanding is the ability to predict the...
research
03/14/2022

Attention based Memory video portrait matting

We proposed a novel trimap free video matting method based on the attent...
research
04/03/2020

Temporally Distributed Networks for Fast Video Semantic Segmentation

We present TDNet, a temporally distributed network designed for fast and...
research
05/22/2023

Spatiotemporal Attention-based Semantic Compression for Real-time Video Recognition

This paper studies the computational offloading of video action recognit...

Please sign up or login with your details

Forgot password? Click here to reset