Learning Internal Representations of 3D Transformations from 2D Projected Inputs

03/31/2023
by   Marissa Connor, et al.
0

When interacting in a three dimensional world, humans must estimate 3D structure from visual inputs projected down to two dimensional retinal images. It has been shown that humans use the persistence of object shape over motion-induced transformations as a cue to resolve depth ambiguity when solving this underconstrained problem. With the aim of understanding how biological vision systems may internally represent 3D transformations, we propose a computational model, based on a generative manifold model, which can be used to infer 3D structure from the motion of 2D points. Our model can also learn representations of the transformations with minimal supervision, providing a proof of concept for how humans may develop internal representations on a developmental or evolutionary time scale. Focused on rotational motion, we show how our model infers depth from moving 2D projected points, learns 3D rotational transformations from 2D training stimuli, and compares to human performance on psychophysical structure-from-motion experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2021

Equivariant Deep Dynamical Model for Motion Prediction

Learning representations through deep generative modeling is a powerful ...
research
05/19/2018

Capturing human category representations by sampling in deep feature spaces

Understanding how people represent categories is a core problem in cogni...
research
06/22/2021

Learning Identity-Preserving Transformations on Data Manifolds

Many machine learning techniques incorporate identity-preserving transfo...
research
12/13/2018

Using Motion and Internal Supervision in Object Recognition

In this thesis we address two related aspects of visual object recogniti...
research
07/16/2023

Multi-Object Discovery by Low-Dimensional Object Motion

Recent work in unsupervised multi-object segmentation shows impressive r...
research
12/14/2021

On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation

Self-supervised learning is a powerful way to learn useful representatio...
research
12/25/2021

Evolutionary Generation of Visual Motion Illusions

Why do we sometimes perceive static images as if they were moving? Visua...

Please sign up or login with your details

Forgot password? Click here to reset