ReConvNet: Video Object Segmentation with Spatio-Temporal Features Modulation

06/14/2018
by   Francesco Lattari, et al.
2

We introduce ReConvNet, a recurrent convolutional architecture for semi-supervised video object segmentation that is able to fast adapt its features to focus on the object of interest at inference time. Generalizing to new objects not observed during training is known to be an hard task for supervised approaches that need to be retrained on the new instances. To tackle this problem, we propose a more efficient solution that learns spatio-temporal features that can be adapted by the model itself through affine transformations conditioned on the object in the first frame of the sequence. This approach is simple, it can be trained end-to-end and does not require extra training steps at inference time. Our method shows comparable results on DAVIS2016 with respect to state-of-the art approaches that use online finetuning, and outperform them on DAVIS2017. ReConvNet shows also promising results on the DAVIS-Challenge 2018 placing in 10-th position.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2019

Fast video object segmentation with Spatio-Temporal GANs

Learning descriptive spatio-temporal object models from data is paramoun...
research
03/13/2019

RVOS: End-to-End Recurrent Network for Video Object Segmentation

Multiple object video object segmentation is a challenging task, special...
research
07/14/2022

Tackling Background Distraction in Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to densely track ce...
research
08/08/2021

Joint Inductive and Transductive Learning for Video Object Segmentation

Semi-supervised video object segmentation is a task of segmenting the ta...
research
09/29/2022

4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation

In this work, we present a new paradigm, called 4D-StOP, to tackle the t...
research
02/01/2021

Consistent Recurrent Neural Networks for 3D Neuron Segmentation

We present a recurrent network for the 3D reconstruction of neurons that...
research
07/03/2018

Deep Spatio-Temporal Random Fields for Efficient Video Segmentation

In this work we introduce a time- and memory-efficient method for struct...

Please sign up or login with your details

Forgot password? Click here to reset