Memory Based Video Scene Parsing

09/01/2021
by   Zhenchao Jin, et al.
0

Video scene parsing is a long-standing challenging task in computer vision, aiming to assign pre-defined semantic labels to pixels of all frames in a given video. Compared with image semantic segmentation, this task pays more attention on studying how to adopt the temporal information to obtain higher predictive accuracy. In this report, we introduce our solution for the 1st Video Scene Parsing in the Wild Challenge, which achieves a mIoU of 57.44 and obtained the 2nd place (our team name is CharlesBLWX).

READ FULL TEXT
research
09/03/2021

Semantic Segmentation on VSPW Dataset through Aggregation of Transformer Models

Semantic segmentation is an important task in computer vision, from whic...
research
12/01/2016

Video Scene Parsing with Predictive Feature Learning

In this work, we address the challenging video scene parsing problem by ...
research
06/06/2023

Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Video scene parsing incorporates temporal information, which can enhance...
research
12/02/2021

TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing

Video scene parsing in the wild with diverse scenarios is a challenging ...
research
06/14/2021

3rd Place Solution for Short-video Face Parsing Challenge

Short videos have many applications on fashion trends, hot spots, street...
research
09/16/2017

Scene-centric Joint Parsing of Cross-view Videos

Cross-view video understanding is an important yet under-explored area i...
research
08/11/2016

Learning Dynamic Hierarchical Models for Anytime Scene Labeling

With increasing demand for efficient image and video analysis, test-time...

Please sign up or login with your details

Forgot password? Click here to reset