FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos

01/19/2017
by   Suyog Dutt Jain, et al.
0

We propose an end-to-end learning framework for segmenting generic objects in videos. Our method learns to combine appearance and motion information to produce pixel level segmentation masks for all prominent objects in videos. We formulate this task as a structured prediction problem and design a two-stream fully convolutional neural network which fuses together motion and appearance in a unified framework. Since large-scale video datasets with pixel level segmentations are problematic, we show how to bootstrap weakly annotated videos together with existing image recognition datasets for training. Through experiments on three challenging video segmentation benchmarks, our method substantially improves the state-of-the-art for segmenting generic (unseen) objects. Code and pre-trained models are available on the project website.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
08/11/2018

Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos

We propose an end-to-end learning framework for segmenting generic objec...
research
12/01/2017

Learning to Segment Moving Objects

We study the problem of segmenting moving objects in unconstrained video...
research
06/25/2017

Decomposing Motion and Content for Natural Video Sequence Prediction

We propose a deep neural network for the prediction of future frames in ...
research
02/25/2022

Weakly Supervised Instance Segmentation using Motion Information via Optical Flow

Weakly supervised instance segmentation has gained popularity because it...
research
09/26/2022

EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations

We introduce VISOR, a new dataset of pixel annotations and a benchmark s...
research
09/21/2023

Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation

Referring Video Object Segmentation (RVOS) requires segmenting the objec...
research
10/07/2021

MGPSN: Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection

Head detection in real-world videos is an important research topic in co...

Please sign up or login with your details

Forgot password? Click here to reset