Object Permanence Emerges in a Random Walk along Memory

04/04/2022
by   Pavel Tokmakov, et al.
5

This paper proposes a self-supervised objective for learning representations that localize objects under occlusion - a property known as object permanence. A central question is the choice of learning signal in cases of total occlusion. Rather than directly supervising the locations of invisible objects, we propose a self-supervised objective that requires neither human annotation, nor assumptions about object dynamics. We show that object permanence can emerge by optimizing for temporal coherence of memory: we fit a Markov walk along a space-time graph of memories, where the states in each time step are non-Markovian features from a sequence encoder. This leads to a memory representation that stores occluded objects and predicts their motion, to better localize them. The resulting model outperforms existing approaches on several datasets of increasing complexity and realism, despite requiring minimal supervision and assumptions, and hence being broadly applicable.

READ FULL TEXT

page 1

page 5

page 7

page 8

page 15

research
06/25/2020

Space-Time Correspondence as a Contrastive Random Walk

This paper proposes a simple self-supervised approach for learning repre...
research
03/09/2021

Self-Supervision by Prediction for Object Discovery in Videos

Despite their irresistible success, deep learning algorithms still heavi...
research
01/20/2022

Learning Pixel Trajectories with Multiscale Contrastive Random Walks

A range of video modeling tasks, from optical flow to multiple object tr...
research
05/21/2022

AutoLink: Self-supervised Learning of Human Skeletons and Object Outlines by Linking Keypoints

Structured representations such as keypoints are widely used in pose tra...
research
11/28/2022

Mix and Localize: Localizing Sound Sources in Mixtures

We present a method for simultaneously localizing multiple sound sources...
research
06/10/2019

Online Object Representations with Contrastive Learning

We propose a self-supervised approach for learning representations of ob...

Please sign up or login with your details

Forgot password? Click here to reset