Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

01/04/2023
by   Matthias Mitterreiter, et al.
0

Capsule neural networks replace simple, scalar-valued neurons with vector-valued capsules. They are motivated by the pattern recognition system in the human brain, where complex objects are decomposed into a hierarchy of simpler object parts. Such a hierarchy is referred to as a parse-tree. Conceptually, capsule neural networks have been defined to realize such parse-trees. The capsule neural network (CapsNet), by Sabour, Frosst, and Hinton, is the first actual implementation of the conceptual idea of capsule neural networks. CapsNets achieved state-of-the-art performance on simple image recognition tasks with fewer parameters and greater robustness to affine transformations than comparable approaches. This sparked extensive follow-up research. However, despite major efforts, no work was able to scale the CapsNet architecture to more reasonable-sized datasets. Here, we provide a reason for this failure and argue that it is most likely not possible to scale CapsNets beyond toy examples. In particular, we show that the concept of a parse-tree, the main idea behind capsule neuronal networks, is not present in CapsNets. We also show theoretically and experimentally that CapsNets suffer from a vanishing gradient problem that results in the starvation of many capsules during training.

READ FULL TEXT

page 3

page 5

page 12

page 24

research
01/29/2020

Examining the Benefits of Capsule Neural Networks

Capsule networks are a recently developed class of neural networks that ...
research
02/07/2020

Subspace Capsule Network

Convolutional neural networks (CNNs) have become a key asset to most of ...
research
07/08/2020

Quaternion Capsule Networks

Capsules are grouping of neurons that allow to represent sophisticated i...
research
03/21/2022

HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network

Capsule networks are designed to present the objects by a set of parts a...
research
03/20/2023

Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D Images

The function of constructing the hierarchy of objects is important to th...
research
11/09/2022

Affordance detection with Dynamic-Tree Capsule Networks

Affordance detection from visual input is a fundamental step in autonomo...
research
04/04/2022

REM: Routing Entropy Minimization for Capsule Networks

Capsule Networks ambition is to build an explainable and biologically-in...

Please sign up or login with your details

Forgot password? Click here to reset