Inference and Learning for Generative Capsule Models

09/07/2022
by   Alfredo Nazabal, et al.
0

Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to (i) data generated from multiple geometric objects like squares and triangles ("constellations"), and (ii) data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders (SCAEs) to tackle this problem – our results show that we significantly outperform them where we can make comparisons (on the constellations data).

READ FULL TEXT

page 10

page 17

research
03/11/2021

Inference for Generative Capsule Models

Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge ...
research
12/12/2012

Expectation-Propogation for the Generative Aspect Model

The generative aspect model is an extension of the multinomial model for...
research
04/07/2020

Capsule Networks – A Probabilistic Perspective

'Capsule' models try to explicitly represent the poses of objects, enfor...
research
03/21/2022

HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network

Capsule networks are designed to present the objects by a set of parts a...
research
06/17/2019

Stacked Capsule Autoencoders

An object can be seen as a geometrically organized set of interrelated p...
research
05/19/2016

Recurrent Exponential-Family Harmoniums without Backprop-Through-Time

Exponential-family harmoniums (EFHs), which extend restricted Boltzmann ...
research
06/16/2022

Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

Human vision involves parsing and representing objects and scenes using ...

Please sign up or login with your details

Forgot password? Click here to reset