Learning Features by Watching Objects Move

12/19/2016
by   Deepak Pathak, et al.
0

This paper presents a novel yet intuitive approach to unsupervised feature learning. Inspired by the human visual system, we explore whether low-level motion-based grouping cues can be used to learn an effective visual representation. Specifically, we use unsupervised motion-based segmentation on videos to obtain segments, which we use as 'pseudo ground truth' to train a convolutional network to segment objects from a single frame. Given the extensive evidence that motion plays a key role in the development of the human visual system, we hope that this straightforward approach to unsupervised learning will be more effective than cleverly designed 'pretext' tasks studied in the literature. Indeed, our extensive experiments show that this is the case. When used for transfer learning on object detection, our representation significantly outperforms previous unsupervised approaches across multiple settings, especially when training data for the target task is scarce.

READ FULL TEXT

page 1

page 2

page 4

page 5

research
12/13/2018

Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation

One major technique debt in video object segmentation is to label the ob...
research
05/08/2015

Learning image representations tied to ego-motion

Understanding how images of objects and scenes behave in response to spe...
research
04/17/2023

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

We study learning object segmentation from unlabeled videos. Humans can ...
research
06/13/2022

Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation

The task of unsupervised semantic segmentation aims to cluster pixels in...
research
05/31/2023

Permutation-Aware Action Segmentation via Unsupervised Frame-to-Segment Alignment

This paper presents a novel transformer-based framework for unsupervised...
research
12/20/2018

Unsupervised Meta-learning of Figure-Ground Segmentation via Imitating Visual Effects

This paper presents a "learning to learn" approach to figure-ground imag...
research
06/10/2021

Unsupervised Co-part Segmentation through Assembly

Co-part segmentation is an important problem in computer vision for its ...

Please sign up or login with your details

Forgot password? Click here to reset