Learning Motion Patterns in Videos

12/21/2016
by   Pavel Tokmakov, et al.
0

The problem of determining whether an object is in motion, irrespective of camera motion, is far from being solved. We address this challenging task by learning motion patterns in videos. The core of our approach is a fully convolutional network, which is learned entirely from synthetic video sequences, and their ground-truth optical flow and motion segmentation. This encoder-decoder style architecture first learns a coarse representation of the optical flow field features, and then refines it iteratively to produce motion labels at the original high-resolution. We further improve this labeling with an objectness map and a conditional random field, to account for errors in optical flow, and also to focus on moving "things" rather than "stuff". The output label of each pixel denotes whether it has undergone independent motion, i.e., irrespective of camera motion. We demonstrate the benefits of this learning framework on the moving object segmentation task, where the goal is to segment all objects in motion. Our approach outperforms the top method on the recently released DAVIS benchmark dataset, comprising real-world sequences, by 5.6 state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 7

page 8

research
09/20/2017

SegFlow: Joint Learning for Video Object Segmentation and Optical Flow

This paper proposes an end-to-end trainable network, SegFlow, for simult...
research
03/07/2016

Blur Robust Optical Flow using Motion Channel

It is hard to estimate optical flow given a realworld video sequence wit...
research
04/04/2023

Divided Attention: Unsupervised Multi-Object Discovery with Contextually Separated Slots

We introduce a method to segment the visual field into independently mov...
research
08/14/2018

Moving Object Segmentation in Jittery Videos by Stabilizing Trajectories Modeled in Kendall's Shape Space

Moving Object Segmentation is a challenging task for jittery/wobbly vide...
research
04/15/2020

Visual Descriptor Learning from Monocular Video

Correspondence estimation is one of the most widely researched and yet o...
research
11/07/2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Generic motion understanding from video involves not only tracking objec...

Please sign up or login with your details

Forgot password? Click here to reset