Learning image representations tied to ego-motion

05/08/2015
by   Dinesh Jayaraman, et al.
0

Understanding how images of objects and scenes behave in response to specific ego-motions is a crucial aspect of proper visual development, yet existing visual learning methods are conspicuously disconnected from the physical source of their images. We propose to exploit proprioceptive motor signals to provide unsupervised regularization in convolutional neural networks to learn visual representations from egocentric video. Specifically, we enforce that our learned features exhibit equivariance i.e. they respond predictably to transformations associated with distinct ego-motions. With three datasets, we show that our unsupervised feature learning approach significantly outperforms previous approaches on visual recognition and next-best-view prediction tasks. In the most challenging test, we show that features learned from video captured on an autonomous driving platform improve large-scale scene recognition in static images from a disjoint domain.

READ FULL TEXT

page 1

page 2

page 8

page 11

page 13

research
12/19/2016

Learning Features by Watching Objects Move

This paper presents a novel yet intuitive approach to unsupervised featu...
research
07/21/2020

Video Representation Learning by Recognizing Temporal Transformations

We introduce a novel self-supervised learning approach to learn represen...
research
01/24/2019

Learning Vector Representation of Content and Matrix Representation of Change: Towards a Representational Model of V1

This paper entertains the hypothesis that the primary purpose of the cel...
research
03/21/2019

Value of Temporal Dynamics Information in Driving Scene Segmentation

Semantic scene segmentation has primarily been addressed by forming repr...
research
07/14/2022

Egocentric Scene Understanding via Multimodal Spatial Rectifier

In this paper, we study a problem of egocentric scene understanding, i.e...
research
07/27/2023

A Memory-Augmented Multi-Task Collaborative Framework for Unsupervised Traffic Accident Detection in Driving Videos

Identifying traffic accidents in driving videos is crucial to ensuring t...
research
01/27/2016

Deep Learning Driven Visual Path Prediction from a Single Image

Capabilities of inference and prediction are significant components of v...

Please sign up or login with your details

Forgot password? Click here to reset