Free Hyperbolic Neural Networks with Limited Radii

07/23/2021
by   Yunhui Guo, et al.
0

Non-Euclidean geometry with constant negative curvature, i.e., hyperbolic space, has attracted sustained attention in the community of machine learning. Hyperbolic space, owing to its ability to embed hierarchical structures continuously with low distortion, has been applied for learning data with tree-like structures. Hyperbolic Neural Networks (HNNs) that operate directly in hyperbolic space have also been proposed recently to further exploit the potential of hyperbolic representations. While HNNs have achieved better performance than Euclidean neural networks (ENNs) on datasets with implicit hierarchical structure, they still perform poorly on standard classification benchmarks such as CIFAR and ImageNet. The traditional wisdom is that it is critical for the data to respect the hyperbolic geometry when applying HNNs. In this paper, we first conduct an empirical study showing that the inferior performance of HNNs on standard recognition datasets can be attributed to the notorious vanishing gradient problem. We further discovered that this problem stems from the hybrid architecture of HNNs. Our analysis leads to a simple yet effective solution called Feature Clipping, which regularizes the hyperbolic embedding whenever its norm exceeding a given threshold. Our thorough experiments show that the proposed method can successfully avoid the vanishing gradient problem when training HNNs with backpropagation. The improved HNNs are able to achieve comparable performance with ENNs on standard image recognition datasets including MNIST, CIFAR10, CIFAR100 and ImageNet, while demonstrating more adversarial robustness and stronger out-of-distribution detection capability.

READ FULL TEXT
research
10/28/2019

Hyperbolic Graph Convolutional Neural Networks

Graph convolutional neural networks (GCNs) embed nodes in a graph into E...
research
04/15/2021

Lorentzian Graph Convolutional Networks

Graph convolutional networks (GCNs) have received considerable research ...
research
05/25/2022

A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning

We present a rotated hyperbolic wrapped normal distribution (RoWN), a si...
research
06/07/2022

Towards Scalable Hyperbolic Neural Networks using Taylor Series Approximations

Hyperbolic networks have shown prominent improvements over their Euclide...
research
02/10/2021

Hyperbolic Generative Adversarial Network

Recently, Hyperbolic Spaces in the context of Non-Euclidean Deep Learnin...
research
02/28/2022

Hyperbolic Graph Neural Networks: A Review of Methods and Applications

Graph neural networks generalize conventional neural networks to graph-s...
research
10/05/2020

A Fully Hyperbolic Neural Model for Hierarchical Multi-Class Classification

Label inventories for fine-grained entity typing have grown in size and ...

Please sign up or login with your details

Forgot password? Click here to reset