DeepAI AI Chat
Log In Sign Up

Quaternion Capsule Networks

by   Barış Özcan, et al.
policy of Özyeğin University

Capsules are grouping of neurons that allow to represent sophisticated information of a visual entity such as pose and features. In the view of this property, Capsule Networks outperform CNNs in challenging tasks like object recognition in unseen viewpoints, and this is achieved by learning the transformations between the object and its parts with the help of high dimensional representation of pose information. In this paper, we present Quaternion Capsules (QCN) where pose information of capsules and their transformations are represented by quaternions. Quaternions are immune to the gimbal lock, have straightforward regularization of the rotation representation for capsules, and require less number of parameters than matrices. The experimental results show that QCNs generalize better to novel viewpoints with fewer parameters, and also achieve on-par or better performances with the state-of-the-art Capsule architectures on well-known benchmarking datasets.


page 1

page 2

page 3

page 4


Capsule Routing via Variational Bayes

Capsule Networks are a recently proposed alternative for constructing Ne...

Dynamic Routing Between Capsules

A capsule is a group of neurons whose activity vector represents the ins...

Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

Capsule neural networks replace simple, scalar-valued neurons with vecto...

Geometric Capsule Autoencoders for 3D Point Clouds

We propose a method to learn object representations from 3D point clouds...

Path Capsule Networks

Capsule network (CapsNet) was introduced as an enhancement over convolut...

Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data

Equivariance is a nice property to have as it produces much more paramet...

On the Capability of Neural Networks to Generalize to Unseen Category-Pose Combinations

Recognizing an object's category and pose lies at the heart of visual un...