Attend, Infer, Repeat: Fast Scene Understanding with Generative Models

03/28/2016
by   S. M. Ali Eslami, et al.
0

We present a framework for efficient inference in structured image models that explicitly reason about objects. We achieve this by performing probabilistic inference using a recurrent neural network that attends to scene elements and processes them one at a time. Crucially, the model itself learns to choose the appropriate number of inference steps. We use this scheme to learn to perform inference in partially specified 2D models (variable-sized variational auto-encoders) and fully specified 3D models (probabilistic renderers). We show that such models learn to identify multiple objects - counting, locating and classifying the elements of a scene - without any supervision, e.g., decomposing 3D images with various numbers of objects in a single forward pass of a neural network. We further show that the networks produce accurate inferences when compared to supervised counterparts, and that their structure leads to improved generalization.

READ FULL TEXT

page 5

page 6

page 7

page 12

page 13

page 14

page 16

research
05/29/2017

Auto-Encoding Sequential Monte Carlo

We introduce AESMC: a method for using deep neural networks for simultan...
research
02/07/2023

Structured Generative Models for Scene Understanding

This position paper argues for the use of structured generative models (...
research
07/12/2022

Scalable Bayesian Inference for Detection and Deblending in Astronomical Images

We present a new probabilistic method for detecting, deblending, and cat...
research
03/19/2021

Knowledge-Guided Object Discovery with Acquired Deep Impressions

We present a framework called Acquired Deep Impressions (ADI) which cont...
research
10/27/2022

ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images

The problem of inferring object shape from a single 2D image is undercon...
research
02/06/2019

Simultaneous x, y Pixel Estimation and Feature Extraction for Multiple Small Objects in a Scene: A Description of the ALIEN Network

We present a deep-learning network that detects multiple small objects (...
research
06/07/2016

Towards a Neural Statistician

An efficient learner is one who reuses what they already know to tackle ...

Please sign up or login with your details

Forgot password? Click here to reset