Learning to Segment Moving Objects

12/01/2017
by   Pavel Tokmakov, et al.
0

We study the problem of segmenting moving objects in unconstrained videos. Given a video, the task is to segment all the objects that exhibit independent motion in at least one frame. We formulate this as a learning problem and design our framework with three cues: (i) independent object motion between a pair of frames, which complements object recognition, (ii) object appearance, which helps to correct errors in motion estimation, and (iii) temporal consistency, which imposes additional constraints on the segmentation. The framework is a two-stream neural network with an explicit memory module. The two streams encode appearance and motion cues in a video sequence respectively, while the memory module captures the evolution of objects over time, exploiting the temporal consistency. The motion stream is a convolutional neural network trained on synthetic videos to segment independently moving objects in the optical flow field. The module to build a 'visual memory' in video, i.e., a joint representation of all the video frames, is realized with a convolutional recurrent unit learned from a small number of training video sequences. For every pixel in a frame of a test video, our approach assigns an object or background label based on the learned spatio-temporal features as well as the 'visual memory' specific to the video. We evaluate our method extensively on three benchmarks, DAVIS, Freiburg-Berkeley motion segmentation dataset and SegTrack. In addition, we provide an extensive ablation study to investigate both the choice of the training data and the influence of each component in the proposed framework.

READ FULL TEXT

page 3

page 6

page 7

page 14

page 15

page 16

research
12/19/2014

Learning to Segment Moving Objects in Videos

We segment moving objects in videos by ranking spatio-temporal segment p...
research
02/11/2019

Towards Segmenting Everything That Moves

Video analysis is the task of perceiving the world as it changes. Often,...
research
11/23/2020

Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation

The objective of this paper is to design a computational architecture th...
research
09/16/2019

Temporally Consistent Depth Prediction with Flow-Guided Memory Units

Predicting depth from a monocular video sequence is an important task fo...
research
01/19/2017

FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos

We propose an end-to-end learning framework for segmenting generic objec...
research
03/14/2022

Implicit Motion Handling for Video Camouflaged Object Detection

We propose a new video camouflaged object detection (VCOD) framework tha...
research
04/17/2023

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

We study learning object segmentation from unlabeled videos. Humans can ...

Please sign up or login with your details

Forgot password? Click here to reset